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PRODUCTION OF ATTENUATED PARAINFLUENZA VIRUS 
VACCINES FROM CLONED NUCLEOTIDE SEQUENCES 

10 

Related Applications 

The present application is a continuation-in-part 
application of, and claims the benefit under Title 35 of U.S. 
15 Provisional Application No. 60/047,575, filed May 23, 1997, 
and U.S. Provisional Application No. 60/059,385, filed 
September 19, 1997, each of which is incorporated herein by- 
reference . 

20 Background of the Invention 

Human parainfluenza viruses (HPIV) , HPIVl, HPIV2, 
and HPIV3 are significant causes of bronchiolitis, croup and 
pneumonia in infants and children. Karron et al., J. Infect. 
Dis. 172: 1445-50 (1995); Collins et al. "Parainfluenza 

25 Viruses", p. 1205-1243. In B. N. Fields et al . , eds . , Fields 
Virology > 3rd ed, vol. 1. Lippincott -Raven Publ . , Philadelphia 
(1996); Murphy et al . , Virus Res. 11:1-15 (1988). Infections 
by these viruses result in substantial morbidity in children 
less than 3 years of age, and are responsible for 

30 approximately 20% of hospitalizations among young infants and 
children for respiratory tract infections. 

Despite considerable efforts to develop effective 
vaccine therapies against HPIV, no approved vaccine agents 
have yet been achieved for any HPIV strain, nor for 

r 

35 ameliorating HPIV related illnesses. To date, only two live 
attenuated PIV vaccine candidates have received particular 
attention. One of these candidates is a bovine PIV (BPIV) 
strain that is antigenically related to HPIV3, and which has 
been shown to protect animals against HPIV3 . BPIV3 is 

40 attenuated, genetically stable and immunogenic in human 

infants and children (Karron et al., J. Inf. Dis. 171:1107-14 
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(1995a); Karron et al., J. Inf. Pis. 172:1445-1450, (1995b)). 
A second PIV3 vaccine candidate; JS cp45 is a cold-adapted 
mutant of the JS wildtype (wt) strain of HPIV3 (Karron et al., 
(1995b), supra ; Belshe et al., J. Med. Virol. 10:235-42 
5 (1982)). This live, attenuated, cold-passaged (cp) PIV3 
vaccine candidate exhibits temperature-sensitive (ts), 
cold-adaptation (ca) , and attenuation (att) phenotypes which 
are stable after viral replication in vivo. The cp45 virus is 
protective against human PIV3 challenge in experimental 

10 animals and is attenuated, genetically stable, and immunogenic 
in seronegative human infants and children (Hall et al., 
Virus Res. 22:173-184 (1992); Karron et al., (1995b), supra . 

HPIV3 is a member of the Paramyxovirus genus of the 
Paramyxovirus family, order Mononegavirales . Its genome is a 

15 single strand of negative-sense RNA 15462 nucleotides (nt) in 
length (Galinski et al., Virology 165 : 499-510, (1988); Stokes 
et al., Virus Res. 25:91-103 (1992)) and encodes at least 
eight proteins (Collins et al., supra , (1996); Galinski, 
supra . (1991) ; Spriggs and Collins, J . Gen . Virol . 67: 2705- 

20 2719, (1986)). Three of these proteins are associated with 
the RNA genome to form the nucleocapsid; namely the 
nucleocapsid protein N, phosphoprotein P, and large polymerase 
subunit L. Three additional proteins are associated with the 
envelope, namely the matrix protein M, taught to mediate viral 

25 attachment and release, the hemagglutinin-neuraminidase 

protein HN, and the fusion protein F. Two other proteins, HN 
and F, represent the neutralizing and protective antigens of 
PIVs (Collins et al . In Fields Virology . 3rd ed., 1:1205-43 
(1996)). Significant sequence divergence in these two 

30 protective antigens among different PIVs is the basis for the 
type specificity of protective immunity against these 
pathogens ( id. ) . 

Another protein of PIV, the C protein, is encoded by 
an overlapping open reading frame (ORF) of the P protein mRNA 

35 (Spriggs and Collins, 1986), and the D protein is generated by 
RNA editing of the P cistron (Galinski et al. Virolocrv 
186:543-50 (1992)) . The P mRNA also contains an internal ORF 
which has the potential to encode *a cystein-rich domain called 
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V. The V ORF is also found in other paramyxoviruses and 
typically is accessed by RNA editing, but this is not the case 
with PIV. Presently, it is not known whether the PIV V ORF is 
expressed. 

5 The viral genome of PIV also contains extragenic 

leader and trailer regions, possessing promoters required for 
viral replication and transcription. Thus, the PIV genetic 
map is represented as 3' leader-N-P/C/D-M-F-HN-L-trailer . 
Transcription initiates at the 3' end and proceeds by a 

10 sequential stop-start mechanism that is guided by short 

conserved motifs found at the gene boundaries. The upstream 
end of each gene contains a gene-start (GS) signal, which 
directs initiation of its respective mRNA. The downstream 
terminus of each gene contains a gene-end (GE) motif which 

15 directs polyadenylation and termination. 



other PIV3 vaccine candidates is of interest for a variety of 
reasons. In particular, it will be useful to understand the 
genetic basis of attenuation and to characterize the molecular 

20 virology and pathogenesis of attenuated HPIV3 strains to 

provide clinically acceptable vaccines for use against these 
and other paramyxoviruses, especially HPIVl and HPIV2 which 
together account for an additional 7% of pediatric hospital 
admissions. To achieve these and related goals, a method for 

25 producing virus with a wt phenotype from cDNA is needed to 

determine which mutation (s) in the cp45 virus specify the ts, 
ca and att phenotypes and which gene(s) of the BPIV3 specify 
the attenuation phenotype. 



30 PIV3 strain, and of the JS wt HPIV3 and cp45 strains have been 
determined (Stokes et al., supra . . (1992); Stokes et al., 
Virus Res . 30: 43-52 (1993)). From these studies, the cp45 
strain was shown to possess at least seventeen nucleotide 
substitutions compared to the parental JS wt HPIV3 strain, 

35 eight of which are correlated with changes to viral proteins. 
However, it has not been previously shown which of these 
identified mutations specify desired, e.g., ts, ca, and att, 
phenotypes. Recently, growth of cp45 at nonpermissive 



Identification of attenuating mutations in cp45 and 



The complete nucleotide sequences of the prototype 
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temperatures was reported to be complemented by coexpression 
of a cDNA clone of the L gene of the 47885 wt PIV3 strain (Ray 
et al., J> Virol. 70:580-584 (1996)), suggesting that the L 
gene may contain one or more mutations which contribute to the 
5 ts phenotype of cp45. However, the results of this study are 
complicated by the fact that the 47885 strain is not isogenic 
with the JS parent of cp45 (for example, the two viruses are 
4% different at the nucleotide level, and the L proteins 
differ at 41 amino acid positions (Stokes et al., supra . 

10 (1992); published erratum appears in Virus Res. 27:96 (1993); 
Virus Res. 25:91-103. Also, this method of complementation 
does not provide a clear measurement of the relative 
contribution of the L gene mutation (s) to the overall ts 
phenotype of cp45. 

15 Rescue and analysis of attenuating mutations in PIV3 

and other RNA viruses require effective methods to manipulate 
cDNAs for the particular viruses of interest. Despite 
previous advancements identifying cDNAs for PIV, manipulation 
of the genomic RNA of this and other negative -sense RNA 

20 viruses has proven difficult. One major obstacle in this 
regard is that the naked genomic RNA of these viruses is 
noninfectious . 

Successful methods for direct genetic manipulation 
of non- segmented negative strand RNA viruses have only 

25 recently begun to be developed (for reviews, see Conzelmann, 
J. Gen. Virol. 77:381-89 (1996); Palese et al . , Proc. Natl. 
Acad. Sci. U.S.A. 93:11354-58. (1996)). Functional 
nucleocapsids have been successfully generated from the 
intracellular coexpression of separately transfected plasmids 

30 bearing the T7 RNA polymerase promoter and encoding either 

genomic or antigenomic RNA and the N, P, and L proteins. The 
intracellular cDNA expression is driven by T7 RNA polymerase 
which is produced by co- infecting with a vaccinia recombinant 
virus. This approach was first used to determine the 

35 transcription and replication requirements of cDNA-encoded 
minireplicons , Some success has been achieved in the 
application of these general methods to rescue infectious 
rabies virus, vesicular stomatitis virus (VSV) , measles virus, 
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and Sendai virus from cDNA-encoded antigenomic RNA in the 
presence of the nucleocapsid N, phosphoprotein P, and large 
polymerase subunit L (Garcin et al . , EMBO J. 14:6087-6094 
(1995); Lawson et al . , Proc. Natl. Acad, Sci . U.S.A. 
5 92:4477-81 (1995); Radecke et al., EMBO J. 14:5773-5784 

(1995); Schnell et al . , EMBO J. 13:4195-203 (1994); Whelan et 
al., Proc. Natl. Acad. Sci. U.S.A. 92:8388-92 (1995)). 
Respiratory synctial virus (RSV) has also been recovered from 
cDNA encoded antigenome but this required the transfection of 

10 an additional plasmid encoding the M2 ORF 1 transcription 
elongation factor (Collins et al., 1995). 

Rescue of infectious PIV virus and other 
Mononegavirales members is complicated by virtue of their non- 
segmented negative -strand RNA genome. The genomic 

15 ribonucleoprotein complexes- (RNPs) of segmented genome 

viruses, such as influenza, are generally small in size and 
loosely structured, and can be assembled in vitro from RNA and 
required viral proteins. However, PIV and other 
Mononegavirales members feature much larger and more tightly 

20 structured RNPs, which tend to be refractory to functional 
association in vitro. Furthermore, the coding potential of 
HPIV3 P mRNA is complicated by cotranscriptional "RNA editing" 
(Galinski et al., Virology 186 ; 543-50 (1992)). The resultant 
shifts in reading frame can access internal ORFs which are 

25 expressed as chimeras fused to the N- terminal part of P. In 
addition, HPIV3 appears to differ from most other 
paramyxoviruses which express a chimeric V protein, as noted 
above. The corresponding set of proteins from HPIV3 editing 
has not yet been identified, and the internal V ORF of HPIV3 

30 is separated from the editing site by numerous translational 
stop codons (Galinski et al. (1992, supra ) . Yet another 
complicating factor is that editing by BPIV3 and HPIV3 
produces a novel chimeric protein D, in which the upstream 
half of P is fused to a domain encoded by a second internal 

35 ORF (Pelet et al., EMBO J. 10: 443-448 (1991); Galinski et 
al., supra , (1992)) . The D protein does not have a 
counterpart in other paramyxoviruses. 
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In view of the foregoing, an urgent need exists in 
the art for tools and methods to engineer safe and effective 
vaccines to alleviate the serious health problems attributable 
to PIV, particularly illnesses among infants and children 
5 attributable to HPIV3 . Quite surprisingly, the present 
invention satisfies these and other related needs. 

Summary Of The Invention 
The present invention provides novel tools and 

10 methods for introducing defined, predetermined structural and 
phenotypic changes into infectious PIV. In one embodiment of 
the invention, an isolated polynucleotide molecule is provided 
which comprises an operably linked transcriptional promoter, a 
polynucleotide sequence encoding a PIV genome or antigenome, 

15 and a transcriptional terminator. 

The PIV genome or antigenome can be a human or 
nonhuman PIV sequence, or a recombinantly modified version 
thereof. In one embodiment, the polynucleotide sequence 
encodes a chimeric genome or antigenome comprising a human PIV 

20 sequence recombinantly joined with a nonhuman PIV sequence, 
such as a gene or gene segment from bovine PIV (BPIV) . In 
additional examples, the polynucleotide encodes a chimera of 
sequences from a nonhuman PIV and at least one other PIV of 
human or nonhuman origin. 

25 In other embodiments, the invention provides an 

isolated infectious PIV particle comprising a recombinant PIV 
(rPIV) genome or antigenome. The isolated infectious PIV 
particle can be a viral or subviral particle. As used herein, 
subviral particle refers to any infectious PIV particle which 

30 lacks a structural element, eg., a gene segment, gene, 

protein, or protein functional domain, which is present in a 
complete virus (eg., an assembled virion including a complete 
genome or antigenome, nucleocapsid and envelope) . Thus, one 
example of a subviral particle of the invention is an 

35 infectious nucleocapsid containing a genome or antigenome, and 
the products of N, P, and L genes. Other subviral particles 
are produced by partial or complete deletions or substitutions 
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of non-essential genes and/or their products (eg., F, HN, M, 
or C) , among other non-essential structural elements. 

The isolated infectious PIV particle is preferably a 
human PIV, more preferably human PIV3 (HPIV3) . The invention 
5 also provides isolated, infectious particles from bovine or 
murine PIV (BPIV or MPIV) , as well as particles comprising 
chimeric sequences from two or more different PIV genomes, for 
example particles incorporating polynucleotide sequences from 
HPIV3 and HPIVl, from HPIV3 and HPIV2 sequences, or comprised 

10 of HPIV3 and BPIV sequences. 

In related aspects of the invention, isolated, 
infectious PIV particles are provided which incorporate 
nucleotide sequences from HPIV3 joined to at least one 
sequence from a heterologous PIV, such as HPIVl, HPIV2, BPIV 

15 or MPIV. For example, entire genes of HPiy3 may be replaced 
by counterpart genes from other forms of PIV, such as the HN 
and/or F glycoprotein genes of PIVl or PIV2 . Alternatively, a 
selected gene segment, for example a cytoplasmic tail, 
transmembrane domain or ectodomain of HN or F of HPIVl or 

20 HPIV2, can be substituted for a corresponding gene segment in 
a counterpart HPIV3 gene to yield constructs encoding chimeric 
proteins, e.g* fusion proteins having a cytoplasmic tail 
and/or transmembrane domain of PIV3 fused to an ectodomain of 
PIVl or PIV2 . Alternatively, genes or gene segments from one 

25 PIV can be added (i.e., without substitution) within a 
heterologous PIV background to create novel immunogenic 
properties within the resultant clone. 

Other modifications can be produced by introducing 
into a PIV genome or antigenome a nucleotide insertion, 

30 rearrangement, deletion or substitution selected to encode a 
desired phenotypic alteration, such as one that results in 
at t enuat ion , temperature - sens i t ivi ty , cold - adapt at ion , smal 1 
plaque size, host range restriction, improved growth in 
vitro . or a change in an immunogenic epitope of PIV. In one 

35 aspect of the invention, mutations occurring in biologically 
derived, attenuated PIV are identified and introduced 
individually or in combination into a full-length PIV clone. 
Typically these mutations are single amino acid changes 
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displayed by biologically derived mutant viruses over a wild- 
type PIV, for example changes exhibited by PIV mutants having 
ts, ca or att phenotypes. These changes from biologically 
derived mutant PIV are incorporated into a recombinant PIV 
5 clone to specify desired characteristics in the resultant 

virus. Exemplary mutations include amino acid changes which 
specify an attenuated phenotype in the HPIV3 strain JS cp45. 
Among these exemplary mutations are mutations occurring within 
the PIV polymerase gene L specifying ts, ca or att phenotypes, 

10 for example amino acid substitutions occurring at Tyr942/ 

Leu992# and/or Thriggg of the JS wild type PIV strain. In more 
detailed aspects, attenuated PIV recombinants are described 
wherein Tyrg^z is replaced by His, Leu992 is replaced by Phe, 
and/or Thrissg is replaced by lie. 

15 Also provided within the invention are recombinant 

PIV having multiple, phenotype- specifying mutations introduced 
in selected combinations into the genome or antigenome of an 
infectious clone to yield desired characteristics including 
attenuation, temperature sensitivity, cold-adaptation, small 

20 plaque size, host range restriction, etc. For example, PIV 
clones are provided which incorporate at least two separate 
mutations adopted from a biologically derived PIV mutant, 
e.g., two ts mutations from HPIV3 JS cp45. Multiply 
attenuated viruses are thus obtained by selecting mutations 

25 from a "menu" of identified lesions and introducing these 

mutations in various combinations to calibrate a vaccine virus 
to selected levels of attenuation, immunogenicity and 
stability. 

In additional embodiments, the invention provides 
30 for supplementation of one or more mutations adopted from 

biologically derived PIV, e.g., ts, ca or att mutations, with 
additional types of mutations involving the same or different 
genes. Target genes for mutation in this context include the 
nucleocapsid protein N, phosphoprotein P, large polymerase 
35 subunit L, matrix protein M, hemagglutinin-neuraminidase 

protein HN, fusion protein F and the C, D and V ORF products. 
In preferred aspects, attenuating mutations adopted from 
biologically derived PIV are incorporated within a chimeric 
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PIV recombinant, e.g., a PIV recombinant having nucleotide 
sequences from both HPIV3 and HPIVl, or from both HPIV and 
BPIV viruses. 

In other embodiments, the invention provides methods 
5 for producing an infectious PIV particle, e.g, a viral or 

subviral particle, from one or more isolated polynucleotide 
molecules encoding a PIV genome or antigenome (see also 
copending United States provisional patent application No. 
60/047575, filed May 23 1997, incorporated herein by reference 

10 in its entirety) . To produce an infectious PIV particle 

according to these methods, an expression vector comprising an 
isolated polynucleotide molecule encoding a PIV genome or 
antigenome is coexpressed in a cell or cell-free system with 
an expression vector comprising one or more isolated 

15 polynucleotide molecules encoding N, P, and L proteins of a 
PIV, whereby an infectious PIV particle is produced. 

The PIV genome or antigenome and the N, P, and L 
proteins may be coexpressed by a single expression vector, or 
by separate expression vectors. In alternate embodiments, the 

20 N, P, and L proteins are each encoded on separate expression 
vectors . 

Within the aforementioned methods, the 
polynucleotide molecule encoding the PIV genome or antigenome 
may correspond to a genomic or antigenomic sequence of human, 

25 bovine or murine PIV. Alternatively, the PIV encoding 

polynucleotide may be a chimera of a human PIV genomic or 
antigenomic sequence and at least one non-human PIV genomic or 
antigenomic sequence. In additional methods for producing 
infectious PIV, the polynucleotide encoding the PIV genome or 

30 antigenome is a chimera of two or more human PIV genomes, for 
example a polynucleotide containing sequences from HPIV3 
joined to sequences from one or more related forms of human 
PIV, such as human PIVl or human PIV2 . Individual genes of 
human PIV3 may be substituted by counterpart genes from 

35 heterologous PIV, for example the HN and F glycoprotein genes 
of PIVl or PIV2, to yield a modified genome or antigenome 
encoding a chimeric PIV. Alternatively, a selected, 
heterologous gene segment, such as a cytoplasmic tail, 
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transmembrane domain or ectodomain of HN or F of HPIVl or 
HPIV2, can be substituted for a counterpart gene segment in a 
different PIV type or different gene, e.g., HN or F of HPIV3, 
to yield constructs encoding chimeric proteins, e.g. fusion 
5 proteins having a cytoplasmic tail and/or transmembrane domain 
of PIV3 fused to an ectodomain of PIVl or PIV2 . 

In yet additional methods for producing infectious 
PIV, the PIV genome or antigenome is modified to yield a 
chimera of a human PIV genomic or antigenomic sequence and at 

10 least one non-human PIV sequence, for example a polynucleotide 
containing sequences from both human and bovine PIV. 

In other methods for producing infectious PIV, the 
PIV genome or antigenome is modified by a nucleotide 
insertion, rearrangement, deletion or substitution selected to 

15 encode a desirable phenotypic alteration, such as one that 
results in attenuation, temperature -sensitivity, cold- 
adaptation, small plaque size, host range restriction, or a 
change in an immunogenic epitope of PIV. Alternatively, the 
polynucleotide molecule encoding the PIV genome or antigenome 

20 can be modified to encode non-PIV molecules, e.g., a cytokine, 
a T-helper epitope, a restriction site marker, or a protein of 
a different microbial pathogen (e.g., virus, bacterium or 
fungus) capable of eliciting a protective immune response in 
the intended host. In one embodiment, the PIV genome or 

25 antigenome is modified to encode protein from a human RSV or 
from measles virus . 

In other embodiments of the invention a cell or 
cell-free expression system (e.g., a cell-free lysate) is 
provided which incorporates an expression vector comprising an 

30 isolated polynucleotide molecule encoding a PIV genome or 

antigenome, and an expression vector comprising one or more 
isolated polynucleotide molecules encoding N, P, and L 
proteins of a PIV. Upon expression, the genome or antigenome 
and N, P, and L proteins combine to produce an infectious PIV 

35 particle, such as a viral or subviral particle. The isolated 
polynucleotide molecules encoding the PIV genome or antigenome 
and the one or more isolated polynucleotide molecules encoding 
N, P, and L proteins of PIV can be expressed by a single 



# 
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vector, or the genome and one or more of the N, P, and L 
proteins can be incorporated into two or more separate 
vectors . 



5 Brief Description of the Drawings 

Figure 1 illustrates construction of the plasmid 
p218(131), encoding a complete copy of HPIV3 genomic RNA. The 
diagram of p218 {A*A'CC'D*E) (top left) illustrates the 
location of the T7 terminator and delta ribozyme adjacent to 

10 nucleotide 1 of the HPIV3 genome. The top right diagram of 

p(E*FF'GHIJKL*) illustrates the locations of the 7 sequencing 
markers (asterisk), see Fig. 2 below, and the T7 promoter 
adjacent to nucleotide 15462 of HPIV3 . The 15 overlapping 
clones used to construct the full length genomic cDNA are 

15 indicated by letters on the outside of the two plasmids. The 
unique restriction sites NgoMl and Xhol were used to clone 
p218(131) which encodes the complete negative- sense genomic 
RNA of HPIV3 illustrated at the bottom. The location of each 
sequencing marker in p218(131) differentiating recombinant JS 

20 from JS wt is indicated by an asterisk. 

Figure 2 depicts sequence markers in cDNA encoded 
genomic RNA of the HN (1) and L{2) genes of HPIV3. Sequences 
are negative sense, mutations are boxed. 

1) A non-coding nt substitution at 7903 removes a 

25 Hga 1 site from JS wt . Nucleotide substitutions in JS cDNA at 
7913 and 7915 create a Sea I site and change amino acid 370 
from proline to threonine, ablating the antibody epitope 
recognized by mAb 170/7 and 423/6. Also, the HN gene was 
found to contain an additional point mutation at nt 7593 which 

30 had not been recognized previously. This results in a 

threonine to isoleucine change at amino acid position 263 in 
the HN protein - 

2) The three incidental non-coding mutations in the 
L gene of JS cDNA that occurred during plasmid construction or 

35 assembly are boxed. 

Figure 3 includes a diagram (not to scale) of the 
PIV3-CAT cDNA (top, drawn as a circular plasmid) and the 
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encoded rainigenome RNA (bottom, drawn as a single strand of 
negative-sense RNA, 3' to 5'). 

Plasmid: The T7 promoter is shown as a black arrow 
pointing in the direction of transcription. Positions are 
5 shown for the delta ribozyme, the T7 transcriptional 

terminator, the inserted vaccinia virus transcriptional 
terminators (T^CT and TgAT) , restriction enzyme sites, and the 
coding sequence for the various domains of the encoded 
minigenome . 

10 Minigenome RNA : The 5' (right hand) end of the 

minigenome is defined by the promoter for T7 RNA polymerase, 
which contributes an extension of two nonviral G residues. 
These are not included in length calculations. The 3' end is 
defined by the ribozyme and is free of any heterologous 

15 nucleotides. PIV3~specif ic regions are shown as open boxes. 
The CAT ORF is shaded and the negative- sense complements of 
the translational initiation (UAC) and termination (AUU) 
codons are shown. Nucleotide positions are indicated 
according to the shortest, 898 -nucleotide minigenome. 

20 Nucleotide lengths of specific regions are given in 

parentheses. The sequence of positions 90 to 124 (3' to 5", 
negative -sense) is shown at the bottom; PIV3-specif ic sequence 
is underlined, the complement of a vaccinia virus 
transcriptional terminator is overlined, the complement of the 

25 Xbal site is italicized, and the complement of the 

translational start codon for the CAT ORF is in bold. The 
site where 0 to 6 G residues were inserted, resulting in 
PIV3-CAT 0 to +6, is indicated. Abbreviations: GS, 
transcriptional gene-start motif; GE, transcriptional gene-end 

30 termination/polyadenylation motif; NT, nontranslated gene 
sequence . 

Figure 4 depicts expression of CAT enzyme by 
minigenomes PIV3-CAT 0 to +6. Cells were transfected with 
plasmids encoding the indicated PIV3-CAT minigenome and the N, 
35 P and L proteins, and were infected simultaneously with 
vaccinia virus recombinant vTF7-3 expressing T7 RNA 
polymerase. Lysates were prepared 48 h post -infection, and 
samples representing 3000 cells were analyzed for CAT 
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activity. Activities are expressed relative to the +2 
minigenome, which is a multiple of six, as 100. Samples in 
which the L plasmid was omitted did not have CAT activity 
detectable under these conditions. 
5 Figures 5A and 5B depict synthesis of 

mini -ant igenomic RNA by PIV3-CAT minigenomes 0 to +6 . Cells 
were transfected and infected as described for Fig. A, except 
for controls in which N plasmid (lane 1) or L plasmid (lane 2) 
was omitted. The PIV3-CAT minigenome plasmid which was 

10 transfected in each case is indicated. Cell lysates were 

prepared at 48 h post-transf ection: one aliquot was treated 
with micrococcal nuclease (treated) and the second was 
mock-treated (untreated) , followed by processing for RNA 
purification. The RNAs were analyzed by Northern blot 

15 hybridization with a negative- sense CAT riboprobe. As a 
marker and control for nuclease treatment, the RSV-CAT C2 
minigenome (Grosfeld et al., J. Virol. 69:5677-5686 (1995), 
incorporated herein by reference in its entirety) was 
complemented by the RSV N, P and L proteins and was processed 

20 in parallel (Lane 10) . Autoradiograms of the hybridized blots 
are shown in Fig. 5A. Fig. 5B shows phosphoriraager analysis 
in which the amount of hybridization relative to the +2 
minigenome was calculated separately for the treated and 
untreated samples. 

25 Figures 6A and 6B depict synthesis of polyadenylated 

mRNA by PIV3-CAT minigenomes 0 to +6 . Using aliquots of cells 
from the same experiment as shown in Figs. 5A and 5B, total 
intracellular RNA was harvested at 48 h post-transf ection and 
fractionated by oligo(dT) chromatography into bound and 

30 unbound fractions, which were analyzed by Northern blot 

hybridization with a negative-sense CAT riboprobe. Fig. 6A 
shows autoradiograms of the hybridized blots. Fig. 6B shows 
the phosphorimager analysis in which the amount of 
hybridization was normalized relative to the bound RNA 

35 fraction of the +2 minigenome, this was calculated separately 
for the bound and unbound fractions. 

Figures 7A and 7B depict accumulation of 
intracellular minigenome in response to plasmid encoding 
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PIV3-CAT minigenomes 0 to +6. This is a continuation of the 
experiment shovm in Figs. 5A and 5B. Thus, cells which had 
been transfected with plasmids encoding a minigenome and the 
N, P and L proteins and infected with vTF7-3 were harvested 48 
5 h post-transf ection and cell lysates were prepared. One 
aliquot of lysate was treated with micrococcal nuclease 
(treated) and the other was mock-treated (untreated) , followed 
by processing for RNA purification. The RNAs were analyzed by 
Northern blot hybridization with a positive-sense CAT 

10 riboprobe, whereas in Fig. 5A the riboprobe was 

negative-sense. Fig. 7A shows autoradiograms of the 
hybridized blots. Fig. 7B shows phosphor imager analysis in 
which the amount of hybridization relative to the +2 
minigenome was calculated separately for the treated and 

15 untreated samples. 

Figures 8-10 illustrate construction of p3/7(131)2G. 
p3/7(131)2G encodes a complete positive-sense HPIV3 antigenome 
and contains two G residues between the T7 promoter and 
nucleotide 1 of the antigenome. The construction involved 

20 modification separately of the left and right halves of cDNA 
218(131) encoding the negative- sense genome. Using the same 
strategy, a second cDNA, p3/7(131) was constructed which is 
identical to p3/7(131)2G except that the two G residues were 
omitted. 

25 Figure 8 depicts construction of p(Left+2G), 

performed by replacing the ribozyme and T7 transcription 
terminator of p218 (A*A' CC D*E) with a T7 promoter including 
two G residues. p218 (A*A'CC'D*E) was used as the template in 
a PGR that amplified the left-hand 1224 nucleotides of the 

30 HPIV3 genome (black rectangle labeled "PIV3seq.") using a left 
hand PGR primer that introduced the T7 promoter. This PGR 
fragment was cloned into the Mlul-Pmll window of 
p218(A*A'CC'D*E) , resulting in p(Left+2G) . 

Figure 9 depicts construction of p(Right+), 

35 performed by replacing the T7 promoter of p(E*FF'GHIHKL*) with 
a ribozyme and T7 terminator placed adjacent to PIV3 
nucleotide 15642 in the positive-sense (antigenome) strand. 
p(E*FF'GHIHKL*) was used as a template in a PGR that amplified 
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the right-hand 64 9 nucleotides of the HPIV3 genome (black 
rectangle labeled "PIV3 seq.") using a mutagenic 
oligonucleotide that added part of the delta ribozyme 
(including a naturally-occurring RsrII site) . This PGR 
5 product was cloned into the RsrII -BssHII window of p3/7 to 
yield pPIV3-3/7, thus reconstructing a complete ribozyme 
flanked by the T7 terminator. The Swal-NgoMI fragment of 
pPIV3-3/7 was cloned into the Swal-NgoMI windows of 
p(E*FF'GHIHKL*) , resulting in p(Right+). The locations of the 

10 seven sequence markers are indicated with asterisks. 

Figure 10 depicts construction of p3/7(131)2G. The 
NgoMI-XhoI fragment of p(Right+) was cloned into the NgoMI- 
Xhol window of p(Left+2G), resulting in p3/7(131)2G. The 
positions of HPIV3 genes are indicated (not to scale) . The 

15 locations of sequence markers are indicated with asterisks. 

Figure 11 shows sequence confirmation of a negative 
sense recombinant PIV3 . A 1379 bp fragment {nucleotides 7334- 
8713) spanning the mutations at 7903, 7913, and 7915 was 
generated by RT-PCR of RNA from infected cells and then 

20 analyzed by cycle -sequencing. The mutations differentiating 
recombinant PIV from JS wt are indicated by arrows. The 
complete sequence from nt 7897 to 7922 is shown in the margin 
next to each gel, with the three nucleotide differences 
indicated in bold. 

25 Figure 12 provides a map of plasmid 

pTM(L) 942/992/1558, which contains the PIV3 L cDNA with amino 
acid substitutions at positions 942, 992, and 1558 in the L 
protein sequence. The relative position of each of coding 
change is indicated, together with the aa difference and the 

30 naturally-occurring restriction site which was deliberately 

ablated as a marker. Restriction sites used for cloning (Sphl, 
PinAI, BamHl and Whel) are indicated. The arrow shows the 
direction of the L protein coding sequence and is numbered 
according to amino acid position. 

35 Figure 13 is a schematic representation of 

recombinant PIV3 viruses bearing representative mutations 
within the invention. 
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Figure 14 illustrates construction of cDNA encoding 
the chimeric PIV3-PIV1 antigenome in which the PIV3 HN and F 
ORFs were replaced by those of PIVl. First (starting from the 
bottom left) the PIV3 F and HN genes were subcloned (as BspE-I 
5 1-Xho I and EcoR I-Spe I fragments) from the full-length PIV3 
cDNA clone p3/7(131)2G (Panel A) to generate pLit.PIV3.F3a and 
pLit.PIV3.HN4 (Panel B) , respectively. PGR mutagenesis (Panel 
C) was performed to delete the complete coding regions of the 
PIV3 F and HN genes and to introduce new restriction sites 

10 (boxed) . PIVl F and HN cDNAs were prepared from infected-cell 
RNA by RT-PCR and cloned into pLITMUS28 (Panel D) . These 
clones, pLit.lFwt and pLit.lHNwt, were used as templates for 
modification of the PIVl F and HN genes at their start and 
stop codons to introduce new restriction sites compatible with 

15 those introduced in the PIV3 deletions (Panel E) . Chimeric F 
and HN genes were constructed by importing the PIVl coding 
regions into the PIV3 deletions to generate pLit.PIV3-l.Fhc 
and pLit.PIV3-l.HNhc (Panel F) . The chimeric F and HN were 
assembled together to generate pSE.PIV3-l.hc (Panel G) . The F 

20 and HN chimeric segment was introduced into full-length PIV3 
clone p3/7(131)2G by replacing its BspE 1-Sph I fragment to 
generate pFLC.2G+.hc (Panel H) . The small boxes between genes 
represent the gene end, intergenic, and gene start regions, 
and the lines represent the non-coding regions. Shaded 

25 portions are from PIVl, open boxes are from PIV3 . The black 
arrows depict the T7 promoter, while black boxes depict the 
hepatitis delta virus (HDV) ribozyme. 

Figure 15A is a diagram of the chimeric HN and F 
genes of the chimeric rPIV3-l (middle) in parallel with that 

30 of rPIV3/JS (top; alternatively referred to herein as rJS and 
JS wt rPIV3) and PIVl/Wash64 (bottom). The four junction 
regions containing the sequence transitions from PIV3 to PIVl 
are boxed and numbered I to IV. Each small box between genes 
represents the gene end, intergenic, and gene start region, 

35 and the lines represent the non-coding regions. RT-PCR 

primers, specific to PIV3 M and L genes (primer pair A at top) 
or specific to PIVl M and HN genes (primer pair B at bottom) 
used in Figure 14B, are depicted as arrows. 
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Figure 15B shows an evaluation of RT-PCR products 
generated using virion RNA (vRNA) as template and the PIV3- or 
PIVl-specif ic primer pairs described in Figure 14A. The 
reverse transcription step was omitted in a duplicate of each 
5 template, as indicated, to confirm that the PGR product was 

derived from RNA. For rPIV3-l, the PIV3 -specific primer pair 
A gave rise to the expected 4.6 kb product, whereas primer 
pair B, specific to PIV-1 sequences not present in rPIV3-l, 
did not yield product. Positive controls using rPIV3/JS 

10 (labeled as rPIV3) or PIVl/Wash64 (labeled as PIVl) vRNA are 
shown in parallel. 

Figure 16 represents sequences of PIV3-PIV1 
junctions in the RT-PCR products of rPIV3"l shown in Figure 
14B were determined. The sequence for each of the four 

15 junction regions (Regions I-IV) is presented and aligned with 
the corresponding regions of rPIV3/JS (top line) and 
PIVl/Wash64 (bottom line) , which were sequenced in parallel 
from RT-PCR products. Vertical bars indicate sequence 
identity, and the boxed regions indicate introduced mutations 

20 and restriction sites. The Gin to Glu codon change in the 
chimeric F gene is indicated by shaded box. Start and stop 
codons are underlined. 

Figure 16 shows sequencing gels for region III 
(left) and IV (right) of RT-PCR products of rPIV3-l compared 

25 with rPIV3 (left) or PIVl (right) . Start and stop codons are 
marked by a box, restriction sites are marked by arrows. 

Figure 18 depicts multicycle growth of parental and 
chimeric PIVs in tissue culture. LLC-MK2 cell monolayers were 
inoculated with virus at an MOI of 0.01, and virus- infected 

30 cells were incubated at 32®C in the presence of trypsin. 
Tissue culture supernatant s were harvested at 24 hour 
intervals, frozen, and analyzed in the same TCID50 assay using 
hemadsorption to identify virus-infected cultures. Each point 
represents the mean titer of three separate cultures, with 

35 S.E. Indicated. The dotted horizontal line indicates the 
lower limit of viral detection. 

Figure 19 illustrates introduction of the three L 
gene mutations of cp45 into pFLC.2G+.hc, the antigenomic cDNA 
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clone of the chimeric virus rPIV3-l. pTM{L) 942/992/1558 (A) 
is a plasmid clone of the L gene that carries the three 
mutations found in cp45. The mutation at amino acid position 
942 in the PIV3 L protein is a tyr (wt) to his (cp45) 
5 substitution and a nearby Eae I site was ablated to mark this 
site. Similarly, the 992 mutation is a leu to phe change with 
a Bsr I site ablated and the 1558 mutation is a thr to ile 
change with an Ava II site ablated. The 2.9 kb Sphl-Nhel 
fragment present in pTM(L) 942/992/1558 was introduced into 

10 pFLC.2G+hc (B) , the plasmid carrying the full length cDNA 
clone of rPIV3-l, to give pFLC+hc . cp45L (C) . For the 
constructs in (B) and (C) , the black boxes indicate the 
location of the hepatitis D virus ribozyme and the T7 
teminator, the shaded regions are the PI VI HN and F ORFs, and 

15 the open boxes represent sequences derived from PIV3 . 

pFLC. 2G+ . he . cp45L was used in the transfection to yield the 
attenuated chimeric recombinant virus designated rPIV3- 
1 . cp45L. 

20 Description of the Specific Embodiments 

The present invention provides compositions and 
methods for producing and modifying infectious PIV from 
isolated polynucleotide molecules, preferably cDNA. 
Infectious PIV particles are produced by a recombinant 

25 coexpression system that permits introduction of defined 

changes into infectious PIV. These modifications are useful 
in a wide variety of applications, including the development 
of live attenuated vaccine strains bearing predetermined, 
defined attenuating mutations. 

30 Infectious PIV of the invention are produced by 

intracellular or cell- free coexpression of one or more 
isolated polynucleotide molecules that encode a PIV genome or 
antigenome RNA, together with one or more polynucleotides 
encoding viral proteins necessary to generate a transcribing, 

35 replicating nucleocapsid . Among the viral proteins useful for 
coexpression to yield infectious PIV are the major 
nucleocapsid protein (N) protein, nucleocapsid phosphoprotein 
(P) , large (L) polymerase protein, fusion protein (F) , 
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hemagglutinin -neuraminidase glycoprotein (HN) , and matrix (M) 
protein. Also useful in this context are products of the C, D 
and V ORFs of PIV. 

cDNAs encoding a PIV genome or antigenome are 
5 constructed for intracellular or in vitro coexpression with 

the necessary viral proteins to form infectious PIV. By "PIV 
antigenome" is meant an isolated positive- sense polynucleotide 
molecule which serves as a template for synthesis of progeny 
PIV genome. Preferably a cDNA is constructed which is a 

10 positive-sense version of the PIV genome corresponding to the 
replicative intermediate RNA, or antigenome, so as to minimize 
the possibility of hybridizing with positive- sense transcripts 
of complementing sequences encoding proteins necessary to 
generate a transcribing, replicating nucleocapsid. 

15 In some embodiments of the invention the genome or 

antigenome of a recombinant PIV (rPIV) need only contain those 
genes or portions thereof necessary to render the viral or 
subviral particles encoded thereby infectious. Further, the 
genes or portions thereof may be provided by more than one 

20 polynucleotide molecule, i.e., a gene may be provided by 
complementation or the like from a separate nucleotide 
molecule. In other embodiments, the PIV genome or antigenome 
encodes all functions necessary for viral growth, replication, 
and infection without the participation of a helper virus or 

25 viral function provided by a plasmid or helper cell line. 

By "recombinant PIV" is meant a PIV or PIV- like 
viral or subviral particle derived directly or indirectly from 
a recombinant expression system or propagated from virus or 
subviral particles produced therefrom. The recombinant 

30 expression system will employ a recombinant expression vector 
which comprises an operably linked transcriptional unit 
comprising an assembly of at least a genetic element or 
elements having a regulatory role in PIV gene expression, for 
example, a promoter, a structural or coding sequence which is 

35 transcribed into PIV RNA, and appropriate transcription 
initiation and termination sequences. 

To produce infectious PIV from a cDNA- expressed PIV 
genome or antigenome, the genome or antigenome is coexpressed 
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with those PIV N, P and L proteins necessary to (i) produce a 
nucleocapsid capable of RNA replication, and (ii) render 
progeny nucleocapsids competent for both RNA replication and 
transcription. Transcription by the genome nucleocapsid 
5 provides the other PIV proteins and initiates a productive 

infection. Alternatively, additional PIV proteins needed for 
a productive infection can be supplied by coexpression. 

Synthesis of PIV antigenome or genome together with 
the above-mentioned viral proteins can also be achieved in 

10 vitro (cell-free), e.g., using a combined transcription- 
translation reaction, followed by transfection into cells. 
Alternatively, antigenome or genome RNA can be synthesized in 
vitro and transfected into cells expressing PIV proteins. 
In certain embodiments of the invention, 

15 complementing sequences encoding proteins necessary to 

generate a transcribing, replicating PIV nucleocapsid are 
provided by one or more helper viruses. Such helper viruses 
can be wild type or mutant. Preferably, the helper virus can 
be distinguished phenotypically from the virus encoded by the 

20 PIV cDNA. For example, it is desirable to provide monoclonal 
antibodies which react immunologically with the helper virus 
but not the virus encoded by the PIV cDNA. Such antibodies 
can be neutralizing antibodies. In some embodiments, the 
antibodies can be used in affinity chromatography to separate 

25 the helper virus from the recombinant virus. To aid the 

procurement of such antibodies, mutations can be introduced 
into the PIV cDNA to provide antigenic diversity from the 
helper virus, such as in the HN or F glycoprotein genes. 

In alternate embodiments of the invention, the N, P, 

30 L and other desired PIV proteins are encoded by one or more 
non- viral expression vectors, which can be the same or 
separate from that which encodes the genome or antigenome. 
Additional proteins may be included as desired, each encoded 
by its own vector or by a vector encoding one or more of the 

35 N, P, L and other desired PIV proteins, or the complete genome 
or antigenome. Expression of the genome or antigenome and 
proteins from transfected plasmids can be achieved, for 
example, by each cDNA being under the control of a promoter 
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for T7 RNA polymerase, which in turn is supplied by infection, 
transfection or transduction with an expression system for the 
T7 RNA polymerase, e.g., a vaccinia virus MVA strain 
recombinant which expresses the T7 RNA polymerase (Wyatt et 
5 al.. Virology, 210: 202-205 (1995), incorporated herein by 

reference in its entirety) . The viral proteins, and/or T7 RNA 
polymerase, can also be provided by transformed mammalian 
cells or by transfection of preformed mRNA or protein. 



10 present invention by, e.g., assembling cloned cDNA segments, 
representing in aggregate the complete antigenome, by 
polymerase chain reaction or the like (PGR; described in, 
e.g., U.S. Patent Nos. 4,683,195 and 4,683,202, and PCR 
Protocols : A Guide to Methods and Applications, Innis et al., 

15 eds.. Academic Press, San Diego (1990), each incorporated 

herein by reference in its entirety) of reverse-transcribed 
copies of PIV mRNA or genome RNA. For example, a first 
construct is generated which comprises cDNAs containing the 
left hand end of the antigenome, spanning from an appropriate 

20 promoter (e.g., T7 RNA polymerase promoter) and assembled in 
an appropriate expression vector, such as a plasmid, cosmid, 
phage, or DNA virus vector. The vector may be modified by 
mutagenesis and/or insertion of synthetic polylinker 
containing unique restriction sites designed to facilitate 

25 assembly. For ease of preparation the N, P, L and other 

desired PIV proteins can be assembled in one or more separate 
vectors. The right hand end of the antigenome plasmid may 
contain additional sequences as desired, such as a flanking 
ribozyme and tandem T7 transcriptional terminators. The 

30 ribozyme can be hammerhead type (e.g., Grosfeld et al,, 

(1995), supra), which would yield a 3' end containing a single 
nonviral nucleotide, or can be any of the other suitable 
ribozymes such as that of hepatitis delta virus (Perrotta et 
al.. Nature 350:434-436 (1991), incorporated herein by 

35 reference in its entirety) which would yield a 3' end free of 
non-PIV nucleotides. The left- and righthand ends are then 
joined via a common restriction site. 



A PIV antigenome may be constructed for use in the 
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A variety of nucleotide insertions, deletions and 
rearrangements can be made in the PIV genome or antigenome 
during or after construction of the cDNA. For example, 
specific desired nucleotide sequences can be synthesized and 
5 inserted at appropriate regions in the cDNA using convenient 
restriction enzyme sites. Alternatively, such techniques as 
site-specific mutagenesis, alanine scanning, PGR mutagenesis, 
or other such techniques well known in the art can be used to 
introduce mutations into the cDNA. 

10 Alternative means to construct cDNA encoding the 

genome or antigenome include reverse transcription-PCR using 
improved PGR conditions (e.g., as described in Cheng et al., 
Proc. Natl. Acad. Sci . USA 91:5695-5699 (1994)), incorporated 
herein by reference) to reduce the number of subunit cDNA 

15 components to as few as one or two pieces. In other 

embodiments different promoters can be used (e.g., T3, SP6) or 
different ribozymes (e.g., that of hepatitis delta virus. 
Different DNA vectors (e.g., cosmids) can be used for 
propagation to better accommodate the larger size genome or 

20 antigenome. 

Isolated polynucleotides (e.g., cDNA) encoding the 
genome or antigenome may be inserted into appropriate host 
cells by transf ection, electroporation, mechanical insertion, 
transduction or the like, into cells which are capable of 

25 supporting a productive PIV infection, e.g., HEp-2, FRhL-DBS2, 
LLC-MK2, MRC-5, and Vero cells. Transf ection of isolated 
polynucleotide sequences may be introduced into cultured cells 
by, for example, calcium phosphate -mediated transf ection 
(Wigler et al . , Cell 14: 725 (1978); Corsaro and Pearson, 

30 Somatic Cell Genetics 7: 603 (1981); Graham and Van der Eb, 
Virology 52: 456 (1973)), electroporation (Neumann et al., 
EMBQ J . 1: 841-845 (1982)), DEAE-dextran mediated transf ection 
(Ausubel et al., (ed.) Current Protocols in Molecular Bioloav . 
John Wiley and Sons, Inc., NY (1987), cationic lipid-mediated 

35 transf ection (Hawley-Nelson et al . , Focus 15: 73-79 (1993)) or 
a commercially available transfection regent, e.g., 
LipofectACE® (Life Technologies, Gaithersburg, MD) or the like 
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(each of the foregoing references are incorporated herein by 
reference in its entirety) . 

As noted above, in some embodiments of the invention 
the N, P, L and other desired PIV proteins are encoded by one 
5 or more helper viruses which is phenotypically distinguishable 
from that which encodes the genome or antigenome. The N, P, L 
and other desired PIV proteins can also be encoded by one or 
more expression vectors which can be the same or separate from 
that which encodes the genome or antigenome, and various 

10 combinations thereof. Additional proteins may be included as 
desired, encoded by its own vector or by a vector encoding one 
or more of the N, P, L and other desired PIV proteins, or the 
complete genome or antigenome. 

The compositions and methods of the invention permit 

15 analysis and manipulation of PIV molecular biology and 

pathogenic mechanisms using, e.g., defined mutations to alter 
the function or expression of selected PIV proteins. Using 
these methods and compositions, one can readily distinguish 
mutations responsible for desired phenotypic changes from 

20 silent incidental mutations, and select phenotype- specif ic 

mutations for incorporation into a recombinant PIV genome or 
antigenome for vaccine production. 

Modifications of PIV provided within the invention 
are directed toward the production of improved vaccine 

25 viruses, e.g., to enhance viral attenuation and vaccine 

immunogenicity, to ablate epitopes associated with undesirable 
immunopathology, to accommodate antigenic drift, etc. To 
achieve these and other objectives, the compositions and 
methods of the invention allow for a wide variety of 

30 modifications to be introduced into a PIV genome or antigenome 
for incorporation into infectious, recombinant PIV. For 
example, foreign genes or gene segments encoding protective 
antigens or epitopes may be added within a PIV clone to 
generate PIV virus strains capable of inducing immunity to 

35 both PIV and another virus or agent from which the protective 
antigen was derived. Alternatively, foreign genes may be 
inserted, in whole or in part, encoding modulators of the 
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immune system, such as cytokines, to enhance immunogenicity of 
a vaccine virus . 

Other mutations which may be included within PIV 
clones of the invention include, for example, substitution of 



encoding a cytoplasmic tail of a glycoprotein gene) with a 
counterpart gene or gene segment in a PIV clone. 
Alternatively, the relative order of genes within a PIV clone 
can be changed, the PIV genome promoter can be replaced with 

10 its antigenome counterpart, or selected gene(s) rendered non- 
functional (e.g., by functional ablation involving 
introduction of a stop codon to prevent expression of the 
gene) . Other modifications in a PIV clone can be made to 
facilitate manipulations, such as the insertion of unique 

15 restriction sites in various non-coding or coding regions or 
elsewhere. In addition, nontranslated gene sequences can be 
removed to increase capacity for inserting foreign sequences. 

Thus, by providing infectious clones of PIV the 
invention permits a wide range of alterations to be 

20 recombinant ly produced within the PIV genome (or antigenome) , 
yielding defined mutations which specify desired phenotypic 
changes. By "infectious clone" is meant cDNA or its product, 
synthetic or otherwise, RNA capable of being directly 
incorporated into infectious virions which can be transcribed 

25 into genomic or antigenomic RNA capable of serving as a 

template to produce the genome of infectious viral or subviral 
particles. As noted above, defined mutations can be 
introduced by a variety of conventional techniques (e.g., 
site-directed mutagenesis) into a cDNA copy of the genome or 

30 antigenome. The use of genomic or antigenomic cDNA 

subfragments to assemble a complete genome or antigenome cDNA 
as described herein has the advantage that each region can be 
manipulated separately, where small cDNA subjects provide for 
better ease of manipulation than large cDNA subjects, and then 

35 readily assembled into a complete cDNA. Thus, the complete 

antigenome or genome cDNA, or a selected subfragment thereof, 
can be used as a template for oligonucleotide-directed 
mutagenesis. This can be through the intermediate of a 



5 



heterologous genes or gene segments (e.g 



a gene segment 



wo 98/53078 



PCT/US98/10S51 



25 

single • Stranded phagemid form, such as using the MUTA-gen® kit 
of Bio-Rad Laboratories (Richmond, CA) , or a method using the 
double -stranded plasmid directly as a template such as the 
Chameleon® mutagenesis kit of Strategene (La Jolla, CA) , or by 
5 the polymerase chain reaction employing either an 

oligonucleotide primer or a template which contains the 
mutation (s) of interest. A mutated subfragment can then be 
assembled into the complete antigenome or genome cDNA. A 
variety of other mutagenesis techniques are knovm and can be 

10 routinely adapted for use in producing the mutations of 

interest in a PIV antigenome or genome cDNA of the invention. 

Thus, in one illustrative embodiment mutations are 
introduced by using the MUTA-gene® phagemid in vitro 
mutagenesis kit available from Bio-Rad Laboratories. In 

15 brief, cDNA encoding an PIV genome or antigenome is cloned 
into the plasmid pTZlBU, and used to transform CJ236 cells 
(Life Technologies) . Phagemid preparations are prepared as 
recommended by the manufacturer. Oligonucleotides are 
designed for mutagenesis by introduction of an altered 

20 nucleotide at the desired position of the genome or 

antigenome. The plasmid containing the genetically altered 
genome or antigenome is then amplified. 

Mutations can vary from single nucleotide changes to 
the introduction, deletion or replacement of large cDNA 

25 segments containing one or more genes or genome segments. 

Genome segments can correspond to structural and/or functional 
domains, e.g., cytoplasmic, transmembrane or ectodomains of 
proteins, active sites such as sites that mediate binding or 
other biochemical interactions with different proteins, 

30 epitopic sites, e.g., sites that stimulate antibody binding 

and/or humoral or cell mediated immune responses, etc. Useful 
genome segments in this regard range from about 15-35 
nucleotides in the case of genome segments encoding small 
functional domains of proteins, e.g., epitopic sites, to about 

35 50, 75, 100, 200-500, and 500-1,500 or more nucleotides. 

The ability to introduce defined mutations into 
infectious PIV has many applications, including the 
manipulation of PIV pathogenic and immunogenic mechanisms. 



wo 98/53078 



PCTAJS98/10551 



26 

For example, the functions of PIV proteins, including the N, 
P, M, F, HN, and L proteins and C, D and V ORF products, can 
be manipulated by introducing mutations which ablate or reduce 
the level of protein expression, or which yield mutant 
5 protein. In one such exemplary modification, a sequence at 
the cleavage site of the F protein can be modified and the 
effects of this modification on growth in tissue culture and 
infection and pathogenesis of the resultant PIV can be 
routinely determined in experimental animals, 

10 Various genome RNA structural features, such as 

promoters, intergenic regions, and transcription signals, can 
also be routinely manipulated within the methods and 
compositions of the invention. The effects of trans-acting 
proteins and cis -acting RNA sequences can be readily 

15 determined, for example, using a complete antigenome cDNA in 
parallel assays employing PIV minigenomes (Dimock, et al., J. 
Virol . 67 : 2772-8 (1993) , incorporated herein by reference in 
its entirety) , whose rescue -dependent status is useful in 
characterizing those mutants that may be too inhibitory to be 

20 recovered in replication- independent infectious virus. 

The present invention further provides tools and 
methods to readily distinguish between silent incidental 
mutations and mutations responsible for phenotype differences, 
for example by introducing suspect mutations, separately and 

25 in various combinations, into the genome or antigenome of 
infectious wild-type (i.e., for one or more phenotypic 
character such as temperature sensitivity, replication in a 
selected host, etc.) PIV. This process permits identification 
of mutations responsible for desired vaccine phenotypes such 

30 as attenuation, temperature sensitivity, cold-adaptation, 
small plaque size, host range restriction, etc. Mutations 
identified by these methods can then be introduced in various 
combinations to modify a vaccine virus to an appropriate level 
of attenuation, etc., as desired. Moreover, the present 

35 invention provides the ability to combine mutations from 
different strains of virus into a single vaccine strain. 

As noted above, mutations incorporated within 
recombinantly altered PIV clones may be selected based on 



wo 98/53078 



PCT/US98/10551 



27 

their ability to alter expression and/or function of a 
selected PIV protein, yielding a desired phenotypic change, or 
for a variety of other purposes. Desired phenotypic changes 
include, e.g., changes in viral growth in culture, temperature 
5 sensitivity, plaque size, attenuation, and immunogenic ity. 

For example, a polynucleotide sequence encoding the genome or 
antigenome can be modified by a nucleotide insertion, 
rearrangement, deletion or substitution to specify 
attenuation, temperature-sensitivity, cold- adaptation, small 

10 plaque size, host range restriction, alteration in gene 
expression, or a change in an immunogenic epitope. 

In one aspect of the invention, mutations occurring 
in biologically derived, attenuated PIV are identified and 
introduced individually or in combination into a full-length 

15 PIV clone, and the phenotypes of rescued recombinant viruses 
containing the introduced mutations are determined. In 
exemplary embodiments, amino acid changes displayed by 
biologically derived mutant viruses over a wild- type PIV, for 
example changes exhibited by PIV mutants having ts, ca or att 

20 phenotypes, are incorporated within recombinant PIV clones. 
These changes from biologically derived mutant PIV specify 
desired characteristics in the resultant clones, e.g., an 
attenuation phenotype specified by a mutation adopted from the 
HPIV3 mutant JS cp45. These changes are preferably introduced 

25 into recombinant virus using two or three nucleotide changes 
compared to a corresponding wild type or biologically derived 
mutant sequence, which has the effect of stabilizing the 
mutation against genetic reversion. 

The present invention also provides recombinant PIV 

30 having multiple, phenotype -specifying mutations introduced in 
selected combinations into the genome or antigenome of an 
infectious clone. This process, coupled with evaluation of 
phenotype, provides mutant recombinant PIV having such desired 
characteristics as attenuation, temperature sensitivity, 

35 cold- adaptation, small plaque size, host range restriction, 
etc. Thus, exemplary PIV clones are disclosed herein which 
incorporate one or more, and preferably at least two 
attenuating mutations, e.g., ts, ca or att mutations adopted 
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from a biologically derived PIV mutant, such as JS cp45. 
Target genes for adopting biologically derived mutations in a 
recombinant PIV in this context include the nucleocapsid 
protein N, phosphoprotein P, large polymerase subunit L, 
5 matrix protein M, hemagglutinin-neuraminidase protein HN, 
fusion protein F and the C, D and V ORF products. Also 
targeted are extragenic sequences, eg,, sequences in the 3' 
leader or trailer regions of a PIV genome, and in cis-acting 
elements such as gene start and gene end sequences, eg., the N 

10 gene start signal. Exemplary mutations incorporated in 
recombinant PIV herein include one or more nucleotide 
substitutions specifying amino acid change (s) in the 
polymerase L gene, e.g., at Tyr942, LeUjgj/ and/or Thrisgg. For 
example, PIV recombinants are disclosed wherein Tyrj^j is 

15 replaced by His, Leu^^j is replaced by Phe, and/or Thriggg is 
replaced by lie. These mutations have been successfully 
incorporated in various exemplary PIV recombinants herein, 
including r942, r992, rl558, r942/992, r992/l558, r942/1558, 
or r942/992/1558 recombinants described in the Examples below. 

20 Other exemplary mutations adopted from a biologically derived 
PIV mutant include one or more mutations in the N protein, 
including specific mutations at a position corresponding to 
residues Valgg or Serjej of JS cp45. In more detailed aspects, 
these mutations are represented as Valjg to Ala or Serjg^ to 

25 Ala. Also disclosed in the Examples below are recombinant PIV 
which encode an amino acid substitution in the C protein, eg., 
a mutation at a position corresponding to Ilegg of JS cp45, 
preferably represented by a substitution of Ile^g to Thr. 
Further exemplary mutations adopted from biologically derived 

30 PIV mutants include one or more mutations in the F protein, 
including mutations adopted from JS cp45 at a position 
corresponding to residues 116420 or Ala45o of JS cp45, 
preferably represented by acid substitutions 116420 to Val or 
Ala45o to Thr. Other PIV recombinants within the invention 

35 adopt one or more amino acid substitutions in the HN protein, 
as exemplified hereinbelow by a recombinant PIV adopting a 
mutation at a position corresponding to residue Val384 of JS 
cp45, preferably represented by the substitution Val304 to Ala. 
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Yet additional examples within this aspect of the invention 
include recombinant PIV which incorporate one or more 
mutations in an extragenic sequence, eg., a 3' leader sequence 
of recombinant PIV genome or antigenome. Exemplary mutations 
5 in this context include mutations in the 3' leader occurs at 
one or more positions corresponding to nucleotide 23, 24, 
and/or 28 of JS cp4 5, for example a T to C change at 
nucleotide 23, a C to T change at nucleotide 24, or a G to T 
change at nucleotide 28. Yet additional extragenic mutations 

10 include one or more mutations in a N gene start sequence, as 
exemplified hereinbelow by a mutation in the N gene start 
sequence at a position corresponding to nucleotide 62 of JS 
cp45, preferably represented by a A to T change. The above 
exemplary mutations adopted from biologically derived mutant 

15 PIV are evaluated and combined into recombinant PIV in the 

Examples below to result, individually and/or in combination, 
in novel, attenuated candidate vaccine strains, as exemplified 
by the recombinants designated herein as rcp45, rcp45 
3'NCMFHN, rcp45 3'NL, rcp45 3'N, and rcp45 F. Other PIV 

20 recombinants within the invention will incorporate a plurality 
and up to a full complement of the mutations present in one or 
more of these exemplary recombinants, as well as mutations 
identified in other biologically derived mutant PIV strains 
identified and adopted in a recombinant PIV according to the 

25 teachings herein. 

Mutations identified according to the methods 
disclosed herein are compiled into a "menu" and introduced in 
various combinations to calibrate a vaccine virus to a 
selected level of attenuation, immunogenicity and stability. 

30 In preferred embodiments, the invention provides for 
supplementation of one or more mutations adopted from 
biologically derived PIV, e.g., ts, ca or att mutations, with 
additional types of mutations involving the same or different 
genes. Target genes for mutation in this context also include 

35 the nucleocapsid protein N, phosphoprotein P, large polymerase 
subunit L, matrix protein M, hemagglutinin-neuraminidase 
protein HN, fusion protein F and the C, D and V ORF products. 
In one aspect, recombinant PIVs are provided wherein at least 
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one attenuating mutation occurs in the PIV polymerase gene L 
and involves a nucleotide substitution specifying a ts or att 
phenotype adopted from a biologically derived mutant PIV 
strain, for example JS cp45. Exemplary HPIV3 recombinants 
5 disclosed herein include the r942, r992, rl558, r942/992, 

r992/l558, r942/1558, or r942/992/1558 recombinants described 
in the Examples below. These exemplary PIV clones incorporate 
one or more nucleotide substitutions resulting in an amino 
acid change in the polymerase gene, e.g., at Tyr942, Leu^gj/ 

10 and/or Thr^sse. For example, PIV recombinants are disclosed 
wherein Tyr942 is replaced by His, LeUjjj is replaced by Phe, 
and/or Thriggg is replaced by lie. Preferably, two or three 
mutations are incorporated in a codon specifying an 
attenuating mutation adding increased stability against 

15 phenotypic reversion. 

In additional aspects, a variety of other genetic 
alterations can be produced in a recombinant PIV genome or 
antigenome for incorporation into infectious recombinant PIV, 
alone or together with one or more attenuating mutations 

20 adopted from a biologically derived mutant PIV. Heterologous 
genes (e.g. from different PIV strains or non-PIV sources such 
as another virus, e.g., RSV or measles virus) may be inserted 
or substituted, in whole or in part, the order of genes 
changed, gene overlap removed, the PIV genome promoter 

25 replaced with its antigenome counterpart, and even entire, 
non-essential genes deleted. In one aspect, a selected PIV 
gene, for example the C, D, or V ORF, is functionally deleted 
to yield a recombinant PIV having novel phenotypic 
characteristics, for example enhanced growth in vitro and/or 

30 attenuation in vivo. An infectious PIV clone of the invention 
can also be engineered to enhance its immunogenicity and 
induce a level of protection greater than that provided by 
natural infection, or to ablate epitopes associated with 
undesirable immunopathologic reactions. Enhanced 

35 immunogenicity of the vaccines produced by the present 
invention addresses one of the greatest obstacles to 
controlling PIV, namely the incomplete nature of immunity 
induced by natural infection. In this context, additional 
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gene(s) or gene segment (s) may be inserted into or proximate 
to the PIV genome or antigenome which may be placed under the 
control of a common or independent set of transcription 
signals. Genes of interest include those encoding cytokines 
5 (e.g., IL-2 through IL-15, especially IL-6 and IL-12, 

etc.) and proteins rich in T helper cell epitopes. The 
additional protein can be expressed either as a separate 
protein or as a chimera engineered from a second copy of one 
of the PIV proteins, such as HN. This provides the ability to 

10 modify and improve the immune response against PIV both 
quantitatively and qualitatively. 

Other mutations useful within the invention involve 
replacement of the 3' end of genome with its counterpart from 
antigenome, which is associated with changes in RNA 

15 replication and transcription. In addition, intergenic 

regions can be shortened or lengthened or changed in sequence 
content. In yet additional aspects, PIV useful in a vaccine 
formulation can be conveniently modified to accommodate 
antigenic drift in circulating virus. Typically the 

20 modification will be in the HN and/or F proteins. For 

example, a selected antigenic form of an entire HN or F gene, 
or the segment (s) encoding particular immunogenic regions 
thereof, is incorporated into a PIV genome or antigenome cDNA 
by replacement of a counterpart region in the infectious 

25 clone, or by adding one or more copies of the gene such that 
several antigenic forms are represented in the resultant 
clone. Progeny virus produced from the modified PIV cDNA are 
then used in vaccination protocols against emerging strains. 

Other mutations for use in infectious PIV of the 

30 invention include mutations in cis-acting signals identified 
during mutational analysis of PIV minigenomes. For example, 
insertional and deletional analysis of leader and trailer and 
flanking sequences identify viral promoters and transcription 
signals and provide a series of mutations associated with 

35 varying degrees of reduction of RNA replication or 

transcription. Saturation mutagenesis (whereby each position 
in turn is modified to each of the nucleotide alternatives) of 
these cis-acting signals also identifies mutations which 
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reduce or increase RNA replication or transcription. Any of 
these mutations can be inserted into the complete antigenome 
or genome as described herein. 

Additional modifications in PIV clones can be made 
5 to facilitate manipulations, such as the insertion of unique 
restriction sites in various intergenic regions or elsewhere. 
Nontranslated gene sequences can also be removed to increase 
capacity for inserting foreign sequences. 

Certain substitutions, insertions, deletions or 

10 rearrangements of genes or gene segments within recombinant 
PIV of the invention (e.g., substitutions of a gene segment 
encoding a selected protein or protein region, for instance a 
cytoplasmic tail, transmembrane domain or ectodomain, an 
epitopic site or region, a binding site or region, an active 

15 site or region containing an active site, etc.) are made in 
structural or functional relation to an existing, 
"counterpart" gene or gene segment from the same or different 
PIV or other source. Such modifications yield novel 
recombinants having desired phenotypic changes compared to 

20 wild-type or parental PIV or other viral strains. For 

example, recombinants of this type may express a chimeric 
protein having a cytoplasmic tail and/or transmembrane domain 
of one PIV fused to an ectodomain of another PIV, Other 
exemplary recombinants of this type express duplicate protein 

25 regions, such as duplicate immunogenic regions. 

As used herein, "counterpart" genes, gene segments, 
proteins or protein regions, are typically from heterologous 
sources (e.g., from different PIV genes, or representing the 
same (i.e., homologous or allelic) gene or gene segment in 

30 different PIV types or strains) . Typical counterparts 

selected in this context share gross structural features, 
e.g., each counterpart may encode a comparable protein or 
protein structural domain, such as a cytoplasmic domain, 
transmembrane domain, ectodomain, binding site or region, 

35 epitopic site or region, etc. Counterpart domains and their 

encoding gene segments embrace an assemblage of species having 
a range of size and sequence variations defined by a common 
biological activity among the domain or gene segment variants. 
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For example, two selected protein domains encoded by 
counterpart gene segments within the invention share 
substantially the same qualitative activity, such as providing 
a membrane spanning function, a specific binding activity, an 
5 immunological recognition site, etc. More typically, a 

specific biological activity shared between counterparts, 
e.g., between selected protein segments or proteins, will be 
substantially similar quantitatively, i.e., they will not vary 
in respective quantitative activity levels by more than 30%, 
10 preferably by no more than 20%, more preferably by no more 
than 5-10%. 

Counterpart genes and gene segments, as well as 
other polynucleotides disclosed herein for producing 
recombinant PIV within the invention, preferably share 

15 substantial sequence identity with a selected polynucleotide 
"reference sequence," e.g., with another selected counterpart 
sequence. As used herein, a "reference sequence" is a defined 
sequence used as a basis for sequence comparison, for example, 
a segment of a full-length cDNA or gene, or a complete cDNA or 

20 gene sequence. Generally, a reference sequence is at least 20 
nucleotides in length, frequently at least 25 nucleotides in 
length, and often at least 50 nucleotides in length. Since 
two polynucleotides may each (1) comprise a sequence (i.e., a 
portion of the complete polynucleotide sequence) that is 

25 similar between the two polynucleotides, and (2) may further 
comprise a sequence that is divergent between the two 
polynucleotides, sequence comparisons between two (or more) 
polynucleotides are typically performed by comparing sequences 
of the two polynucleotides over a "comparison window" to 

30 identify and compare local regions of sequence similarity. A 
"comparison window", as used herein, refers to a conceptual 
segment of at least 20 contiguous nucleotide positions wherein 
a polynucleotide sequence may be compared to a reference 
sequence of at least 20 contiguous nucleotides and wherein the 

35 portion of the polynucleotide sequence in the comparison 

window may comprise additions or deletions (i.e., gaps) of 20 
percent or less as compared to the reference sequence (which 
does not comprise additions or deletions) for optimal 
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alignment of the two sequences. Optimal alignment of 
sequences for aligning a comparison window may be conducted by 
the local homology algorithm of Smith & Waterman, Adv . Appl . 
Math. 2:482 (1981), by the homology alignment algorithm of 
5 Needleman & Wunsch, J. Mol. Biol. 48:443 (1970), by the search 
for similarity method of Pearson & Lipman, Proc . Nat 1 . Acad . 
Sci . USA 85:2444 (1988) (each of which is incorporated by 
reference) , by computerized implementations of these 
algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin 
10 Genetics Software Package Release 7.0, Genetics Computer 

Group, 575 Science Dr., Madison, WI, incorporated herein by 
reference), or by inspection, and the best alignment (i.e., 
resulting in the highest percentage of sequence similarity 
over the comparison window) generated by the various methods 
15 is selected. The term "sequence identity" means that two 

polynucleotide sequences are identical (i.e., on a nucleotide- 
by-nucleotide basis) over the window of comparison. The term 
"percentage of sequence identity" is calculated by comparing 
two optimally aligned sequences over the window of comparison, 
20 determining the number of positions at which the identical 

nucleic acid base (e.g.. A, T, C, G, U, or I) occurs in both 
sequences to yield the number of matched positions, dividing 
the number of matched positions by the total number of 
positions in the window of comparison (i.e., the window size), 
25 and multiplying the result by 100 to yield the percentage of 
sequence identity. The terms "substantial identity" as used 
herein denotes a characteristic of a polynucleotide sequence, 
wherein the polynucleotide comprises a sequence that has at 
least 85 percent sequence identity, preferably at least 90 to 
30 95 percent sequence identity, more usually at least 99 percent 
sequence identity as compared to a reference sequence over a 
comparison window of at least 20 nucleotide positions, 
frequently over a window of at least 25-50 nucleotides, 
wherein the percentage of sequence identity is calculated by 
35 comparing the reference sequence to the polynucleotide 

sequence which may include deletions or additions which total 
20 percent or less of the reference sequence over the window 
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of comparison. The reference sequence may be a subset of a 
larger sequence. 

In addition to these polynucleotide sequence 
relationships, proteins and protein regions encoded by 
5 recombinant PIV of the invention are also typically selected 
to have conservative relationships, i.e. to have substantial 
sequence identity or sequence similarity, with selected 
reference polypeptides. As applied to polypeptides, the term 
"sequence identity" means peptides share identical amino acids 

10 at corresponding positions. The term "sequence similarity" 
means peptides have identical or similar amino acids (i.e., 
conservative substitutions) at corresponding positions. The 
term "substantial sequence identity" means that two peptide 
sequences, when optimally aligned, such as by the programs GAP 

15 or BESTFIT using default gap weights, share at least 80 

percent sequence identity, preferably at least 90 percent 
sequence identity, more preferably at least 95 percent 
sequence identity or more (e.g., 99 percent sequence 
identity) . The term "substantial similarity" means that two 

20 peptide sequences share corresponding percentages of sequence 
similarity. Preferably, residue positions which are not 
identical differ by conservative amino acid substitutions. 
Conservative amino acid substitutions refer to the 
interchangeability of residues having similar side chains. 

25 For example, a group of amino acids having aliphatic side 

chains is glycine, alanine, valine, leucine, and isoleucine; a 
group of amino acids having aliphatic- hydroxy 1 side chains is 
serine and threonine; a group of amino acids having amide- 
containing side chains is asparagine and glutamine; a group of 

30 amino acids having aromatic side chains is phenylalanine, 

tyrosine, and tryptophan; a group of amino acids having basic 
side chains is lysine, arginine, and histidine; and a group of 
amino acids having sulfur-containing side chains is cysteine 
and methionine. Preferred conservative amino acids 

35 substitution groups are: valine-leucine-isoleucine, 

phenylalanine -tyrosine, lysine -arginine, alanine-valine, and 
asparagine -glutamine . Abbreviations for the twenty naturally 
occurring amino acids used herein follow conventional usage 
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(Immunology - A Synthesis (2nd ed., E.S. Golub & D.R. Gren, 
eds., Sinauer Associates, Sunderland, MA, 1991), incorporated 
herein by reference). Stereoisomers (e.g., D-amino acids) of 
the twenty conventional amino acids, unnatural amino acids 
5 such as a, a-disubstituted amino acids, N-alkyl amino acids, 

lactic acid, and other unconventional amino acids may also be 
suitable components for polypeptides of the present invention. 
Examples of unconventional amino acids include: 4- 
hydroxyproline , y-carboxyglutamate , e -N, N, N- trimethyllysine , 

10 e-N-acetyllysine, 0-phosphoserine, N-acetylserine, N- 

f ormylmethionine , 3 -methylhist idine , 5 - hydroxy lysine , co-N- 
methylarginine, and other similar amino acids and imino acids 
(e.g., 4-hydroxyproline) . Moreover, amino acids may be 
modified by glycosylation, phosphorylation and the like, 

15 The infectious PIV produced from cDNA- expressed 

genome or antigenome can be any of the PIV or PIV- like 
strains, e.g., human, bovine, murine, etc. To engender a 
protective immune response, the PIV strain may be one which is 
endogenous to the subject being immunized, such as human PIV 

20 being used to immunize humans. The genome or antigenome can 
be modified, however, to express heterologous PIV genes or 
gene segments, or genes or gene segments from other 
heterologous sources, e.g., RSV or measles virus. Thus, 
infectious PIV intended for administration to humans may be 

25 human PIV that has been modified to contain genes or gene 
segments from a bovine or murine PIV type such as for the 
purpose of attenuation. BPIV3 possesses host range mutations 
that restrict its replication in rhesus monkeys and humans 
(Karron et al., supra, 1995a; van Wyke Coelingh et al . , 1988), 

30 each incorporated herein by reference in its entirety) . 

Gene(s), mutations and cis-acting regulatory sequences of 
BPIV3 that specify desired phenotypes, e.g., host range 
restriction, will be identified by their substitution for 
corresponding sequences in rPIV of the invention, and 

35 incorporated within further modified rPIV to develop yet 

additional useful vaccine agents. Similarly, mutations of JS 
cp45 which are known to impart non-ts host-range attenuating 
mutations for the rhesus monkey (Hall et al., supra . (1992)) 



wo 98/53078 



PCTAJS98/10551 



37 

will likewise be identified and incorporated into modified 
rPIV vaccine agents of the invention. Alternatively, a bovine 
PIV may be modified to contain genes that encode, e.g., 
proteins or immunogenic epitopes that elicit protection 
5 against human PIV infection. For example, human PIV 

glycoprotein genes can be substituted for counterpart bovine 
glycoprotein genes, such that the bovine PIV elicits a 
protective immune response in humans against human PIV 
strains. 

10 In exemplary embodiments, individual genes, gene 

segments, or single or multiple nucleotides of one PIV are 
substituted by counterpart sequence (s) from a heterologous PIV 
or other source. For example, heterologous gene segments, 
such as one encoding a cytoplasmic tail, transmembrane domain 

15 or ectodomain, an epi topic site or region, a binding site or 
region, an active site or region containing an active site, 
etc., of a selected protein from one PIV is substituted for a 
counterpart gene segment in another PIV to yield novel 
recombinants, for example recombinants expressing a chimeric 

20 protein having a cytoplasmic tail and/or transmembrane domain 
of one PIV fused to an ectodomain of another PIV. Preferred 
genome segments in this regard range from about 15-35 
nucleotides in the case of gene segments encoding small 
functional domains of proteins, e.g., epitopic sites, to about 

25 50, 75, 100, 200-500, or 500-1,500 or more nucleotides for 
gene segments encoding larger domains or protein regions . 

In one aspect of the invention, selected domains of 
HN and/or F proteins of one PIV strain are substituted into a 
heterologous PIV clone to produce a recombinant virus capable 

30 of stimulating a cross -protective immune response against both 
PIV strains in an immunized host. In other aspects, modified 
PIV clones are provided which comprise a chimera of a human 
PIV genomic or antigenomic sequence and at least one non-human 
PIV sequence, for example a polynucleotide containing 

35 sequences from both human and bovine PIV. The replacement of 
a human PIV coding sequence or non-coding sequence (e.g., a 
promoter, gene-end, gene-start, intergenic or other cis-acting 
element) with a counterpart bovine or murine PIV sequence 



wo 98/53078 



PCT/US98/105S1 



38 

yields recombinants having a variety of possible attenuating 
effects. For example, a host range effect will often arise 
from a heterologous PIV gene not functioning efficiently in a 
human cell, from incompatibility of the heterologous sequence 
5 or protein with a biologically interactive human PIV sequence 
or protein (e.g., a sequence or protein that ordinarily 
cooperates with the substituted sequence or protein for viral 
transcription, translation, assembly, etc.), among other 
useful attenuating effects. In yet another aspect of the 

10 invention, insertion of foreign genes or gene segments, and in 
some cases of noncoding nucleotide sequences, into the PIV 
genome results in a desired increase in genome length causing 
yet additional, desired phenotypic effects. Increased genome 
length is expected to result in attenuation of the resultant 

15 PIV clone, dependent in part upon the length of the insert. 
In addition, the expression of certain proteins from a gene 
inserted into recombinant PIV will result in attenuation of 
the virus due to the action of the protein. This has been 
described for IL-2 expressed in vaccinia virus (see, e.g., 

20 Flexner et al.. Nature 33: -259-62 (1987)) and also would be 
expected for gamma interferon. 

Deletions, insertions, substitutions and other 
mutations involving changes of whole viral genes or gene 
segments in recombinant PIV of the invention yield highly 

25 stable vaccine candidates, which are particularly important in 
the case of immunosuppressed individuals. Many of these 
mutations will result in attenuation of resultant vaccine 
strains, whereas others will specify different types of 
desired phenotypic changes. For example, certain viral genes 

30 are known which encode proteins that specifically interfere 

with host immunity (see, e.g., Kato et al., EMBO. J. 16:578-87 
(1997) , incorporated herein by reference) . Ablation of such 
genes in vaccine viruses is expected to reduce virulence and 
pathogenesis and/or improve immunogenicity . 

35 In preferred aspects of the invention, the modified 

PIV clones represent a chimera of two or more human PIV 
genomes, for example a clone containing polynucleotide 
sequences from HPIV3 joined to sequences from one or more 



# 
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heterologous human PIV, such as HPIVl and HPIV2. Thus, 
individual genes or gene segments of human PIV3 may be 
replaced or supplemented with counterpart genes or gene 
segments from HPIV 1 or HPIV2, or visa versa. In one example 
5 described hereinbelow, the invention provides a PIV clone, 
rPIV3-l, into which both the HN and F glycoprotein genes of 
HPIVl are substituted for their counterpart genes in an HPIV3 
background, yielding a chimeric virus having immunological 
characteristics representative of both parental strains. 

10 In additional aspects of the invention, chimeric PIV 

or PIV clones having other alterations of genes or gene 
segments, as described above, are further modified by 
introducing one or more attenuating mutations adopted from a 
biologically derived mutant PIV, e.g., HPIV3 JS cp45 to 

15 achieve an attenuated, or further attenuated, chimeric mutant 
derivative. For example, one or more human PIV coding or non- 
coding polynucleotides may be substituted with a counterpart 
sequence from a heterologous human PIV, bovine PIV or murine 
PIV as described above, and this alteration may be combined 

20 with one or more mutations specifying, e.g., a ts, ca or att 
phenotype adopted from a biologically derived attenuated PIV 
mutant, to yield an attenuated or further attenuated (i.e., 
compared to either the chimeric clone or biologically derived 
parent virus) vaccine virus. Alternatively, functional 

25 deletion of a non-essential gene or gene segment, such as the 
C, D or V ORF, may be combined in a recombinant PIV with one 
or more mutations specifying a ts, ca or att phenotype from 
biologically derived PIV mutants to yield an attenuated 
vaccine strain. These combinatorial modifications yield 

30 recombinant PIV having desired phenotypic characteristics, 

e.g., increased yield of virus, enhanced attenuation, and/or 
genetic resistance to reversion from an attenuated phenotype, 
due to the combined effects of the different selected 
mutations. 

35 In one combinatorial mutation design, a modified PIV 

is provided which comprises a chimera of a human PIV genomic 
or antigenomic sequence and at least one non-human PIV 
sequence, for example a polynucleotide containing sequences 
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from both human and bovine PIV, and which also incorporates 
one or more mutations adopted from biologically derived PIV, 
e.g., one or more naturally occurring ts, ca or att mutations. 
Alternatively, the modified PIV can be a chimera of two or 
5 more human PIV genomes, for example a polynucleotide 

containing sequences from HPIV3 joined to sequences from one 
or more heterologous human PIVs, such as HPIVl and HPIV2, 
which further incorporates one or more ts, ca att or other 
selected mutations from biologically derived PIV (e.g., a 

10 nucleotide substitution specifying a ts, ca or att phenotype 
adopted from a biologically derived mutant PIV strain such as 
JS cp45) . In more detailed aspects, individual genes or gene 
segments of human PIV3 are replaced or supplemented with 
counterpart genes or gene segments from HPIVl or HPIV2, or 

15 visa versa, in a clone that is attenuated or further 

attenuated by, e.g., a nucleotide change encoding an amino 
acid substitution conferring a ts mutation in the large 
polymerase L gene. For example, the invention provides PIV 
clones having the HN and/or F glycoprotein genes of HPIVl 

20 substituted for their counterpart genes in an HPIV3 

background, wherein the phenotype of the resultant chimeric 
clone is further modified by ts, ca or att mutation (s) encoded 
within one or more of the N, P, L, M, HN, F, C, D and V genes. 
Various combinations from a menu of possible mutations 

25 disclosed herein can be made to calibrate a vaccine virus to a 
selected level of attenuation, immunogenicity and stability, 
e.g., to achieve a satisfactorily attenuated and immunogenic, 
chimeric virus having immunological characteristics 
representative of multiple PIV strains. In one aspect, 

30 recombinant PIVs are provided wherein at least one attenuating 
mutation occurs in the PIV polymerase gene L (as exemplified 
by the recombinants r942, r992, rl558, r942/992, r942/l558, 
r992/l558, or r942/992/l558 described in the Examples below) 
incorporated in a chimeric PIV background. For example, 

35 useful chimeric PIV recombinants within this aspect of the 

invention will have one or more genes or gene segments of the 
HN and/or F glycoprotein genes from, e.g., HPIVl substituted 
for their counterpart gene(s) in a heterologous background. 
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e.g., in an HPIV3 clone, and will further incorporate one or 
more attenuating mutations, eg., nucleotide substitutions 
resulting in an amino acid change in the polymerase gene (such 
as change from Tyr to His at position 942, a change from Leu 
5 to Phe at position 992, and/or a change from Thr to lie at 
position 1558) from a biologically derived PIV mutant. One 
such chimeric, attenuated recombinant exemplified hereinbelow 
is rPIV3-l. cp45L, a derivative of rPIV3-l which incorporates 
all three L gene mutations specified above from JS cp45. 

10 Yet additional mutations which can be incorporated in a 

chimeric PIV background for developing vaccine strains will be 
selected from biologically derived mutations in other genes, 
or will be created de novo in a recombinant genome by standard 
site directed mutagenesis or other purely recombinant 

15 mutagenic methods. Target genes for adopting biologically 

derived mutations or creating novel mutations in a recombinant 
PIV in this context include the nucleocapsid protein N, 
phosphoprotein P, large polymerase subunit L, matrix protein 
M, hemagglutinin-neuraminidase protein HN, fusion protein F 

20 and the C, D and V ORF products. Also targeted are extragenic 
sequences, eg., sequences in the 3' leader or trailer regions 
of a PIV genome. Exemplary mutations identified and 
incorporated in non-chimeric, recombinant PIV, described above 
will thus be readily incorporated within a chimeric PIV 

25 background, eg., as exemplified by rPIV3-l. These exemplary 
mutations include one or more mutations in the N protein, 
including specific mutations at a position corresponding to 
residues Valjg or Ser^Q^ of JS cp45. In more detailed aspects, 
these mutations are represented as Valj^ to Ala or Serjgg to 

30 Ala. Also desired for incorporation in chimeric PIV 

recombinants are mutations in the C protein, eg., a mutation 
at a position corresponding to lle^^ of JS cp45, preferably 
represented by a substitution of Ile^g to Thr, as described 
above. Further exemplary mutations for incorporation in a 

35 chimeric PIV background include one or more mutations in the F 
protein, for example adopted from JS cp45 at a position 
corresponding to residues Ile^ao or Ala45o/ eg., substitutions 
Ile^jo to Val or Ala45o to Thr. Yet additional chimeric PIV 
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recombinants within the invention will adopt one or more amino 
acid substitutions in the HN protein, for example a mutation 
at a position corresponding to residue Valag^ of JS cp45, such 
as Val384 to Ala. Yet additional chimeric recombinants will 
5 incorporate one or more mutations in an extragenic sequence, 
eg . , a 3 ' leader sequence of the recombinant genome or 
antigenome. Exemplary mutations in this context include 
mutations in the 3' leader occurs at one or more positions 
corresponding to nucleotide 23, 24, 28, and/or 45 of JS cp45, 

10 for example a T to C change at nucleotide 23, a C to T change 
at nucleotide 24, a G to T change at nucleotide 28, or a T to 
A change at nucleotide 45. Yet additional extragenic 
mutations for incorporation within a chimeric PIV background 
include one or more mutations in.a N gene start sequence, as 

15 exemplified herein by a mutation in the N gene start sequence 
at a position corresponding to nucleotide 62 of JS cp45, such 
as a A to T change. These exemplary mutations evaluated and 
combined into recombinant PIV in the Examples below will be 
readily incorporated within a chimeric PIV recombinant using 

20 the methods and tools provided herein, and will specify, 

individually and/or in combination, desired phenotypic changes 
to yield yet additional attenuated chimeric vaccine strains 
within the invention - 

In additional combinatorial mutation designs, modified 

25 PIVs are provided which incorporate one or more of the 

foregoing ts, ca or att mutations adopted from biologically 
derived PIV or generated recombinantly in a PIV clone of the 
invention, in combination with another, distinct mutation 
disclosed herein (e.g., a deletion, addition, or rearrangement 

30 of a PIV N, P, L, M, HN, F, C, D or V gene or gene segment, or 
a gene or gene segment from another source such as RSV or 
measles virus) , Also in this case, various combinations from 
a menu of mutations disclosed herein can be made to calibrate 
the vaccine virus to a selected level of attenuation, 

35 immunogenicity and stability. Thus, recombinant PIVs are 

provided which exhibit at least one attenuating mutation from 
a biologically derived PIV mutant, e.g., a mutation in the PIV 
polymerase gene L as found in JS cp45, or a recombinantly 
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generated mutation, and which further incorporates one or more 
additional changes selected from, e.g., substitution or 
introduction of a heterologous gene or gene segment from a 
non-PiV source (e.g., an immunogenic RSV or measles gene or 
5 epitope, or a gene encoding a cytokine) , a change in the order 
of viral genes to alter expression levels, removal of gene 
overlap, substitution of a PIV genome promoter with its 
antigenome counterpart, shortening, lengthening or removal of 
intergenic regions, e.g., to increase capacity for inserting 

10 foreign sequences, mutations in cis-acting signals to reduce 
or increase RNA replication or transcription, insertion of 
unique restriction sites, or deletion of even entire, non- 
essential genes, among other changes. 

For vaccine use, virus produced according to the 

15 present invention can be used directly in vaccine 

formulations, or lyophilized, as desired, using lyophilization 
protocols well known to the artisan. Lyophilized virus will 
typically be maintained at about 4®C. When ready for use the 
lyophilized virus is reconstituted in a stabilizing solution, 

20 e.g., saline or comprising SPG, Mg** and HEPES, with or without 
adjuvant, as further described below. 

PIV vaccines of the invention contain as an active 
ingredient an immunogenically effective amount of PIV produced 
as described herein. The modified virus may be introduced 

25 into a host with a physiologically acceptable carrier and/or 
adjuvant. Useful carriers are well known in the art, and 
include, e.g., water, buffered water, 0.4% saline, 0.3% 
glycine, hyaluronic acid and the like. The resulting aqueous 
solutions may be packaged for use as is, or lyophilized, the 

30 lyophilized preparation being combined with a sterile solution 
prior to administration, as mentioned above. The compositions 
may contain pharmaceutically acceptable auxiliary substances 
as required to approximate physiological conditions, such as 
pH adjusting and buffering agents, tonicity adjusting agents, 

35 wetting agents and the like, for example, sodium acetate, 

sodium lactate, sodium chloride, potassium chloride, calcium 
chloride, sorbitan monolaurate, triethanolamine oleate, and 
the like. Acceptable adjuvants include incomplete Freund's 
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adjuvant, MPL™ (3-o-deacylated nionophosphoryl lipid A; RIBI 
ImmunoChem Research, Inc., Hamilton, MT) and IL-12 (Genetics 
Institute, Cambridge MA) , among many other suitable adjuvants 
well knovm in the art. 
5 Upon immunization with a PIV composition as 

described herein, via aerosol, droplet, oral, topical or other 
route, the immune system of the host responds to the vaccine 
by producing antibodies specific for PIV virus proteins, e.g., 
F and HN glycoproteins. As a result of the vaccination with 

10 an immunogenically effective amount of PIV produced as 

described herein, the host becomes at least partially or 
completely immune to PIV infection, or resistant to developing 
moderate or severe PIV infection, particularly of the lower 
respiratory tract. 

15 The host to which the vaccines are administered can 

be any mammal which is susceptible to infection by PIV or a 
closely related virus and which host is capable of generating 
a protective immune response to the antigens of the 
vaccinizing strain. Accordingly, the invention provides 

20 methods for creating vaccines for a variety of human and 
veterinary uses. 

The vaccine compositions containing the PIV of the 
invention are administered to a host susceptible to or 
otherwise at risk for PIV infection to enhance the host's own 

25 immune response capabilities. Such an amount is defined to be 
a "immunogenically effective dose." In this use, the precise 
amount of PIV to be administered within an effective dose will 
depend on the host's state of health and weight, the mode of 
administration, the nature of the formulation, etc., but will 

30 generally range from about 10^ to about 10^ plaque forming 
units (PFU) or more of virus per host, more commonly from 
about 10* to 10^ PFU virus per host. In any event, the vaccine 
formulations should provide a quantity of modified PIV of the 
invention sufficient to effectively protect the host patient 

35 against serious or life -threatening PIV infection. 

The PIV produced in accordance with the present 
invention can be combined with viruses of other PIV serotypes 
or strains to achieve protection against multiple PIV 
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serotypes or strains. Alternatively, protection against 
multiple PIV serotypes or strains can be achieved by combining 
protective epitopes of multiple serotypes or strains 
engineered into one virus, as described herein. Typically 
5 when different viruses are administered they will be in 

admixture and administered simultaneously, but they may also 
be administered separately. Immunization with one strain may 
protect against different strains of the same or different 
serotype . 

10 In some instances it may be desirable to combine the 

PIV vaccines of the invention with vaccines which induce 
protective responses to other agents, particularly other 
childhood viruses. In another aspect of the invention the PIV 
can be employed as a vector for protective antigens of other 

15 pathogens, such as respiratory syncytial virus (RSV) or 

measles virus, by incorporating the sequences encoding those 
protective antigens into the PIV genome or antigenome which is 
used to produce infectious PIV, as described herein. The 
cloning of RSV cDNA and other disclosure relevant to the 

20 invention is described in copending United States patent 

applications Serial Nos. 08/534,768, 60/021,773, 08/720,132, 
60/046,141, 60/047,634, and 08/892,403, and PCT patent 
application PCT/US97/12269, each incorporated herein by 
reference. 

25 Single or multiple administrations of the vaccine 

compositions of the invention can be carried out. In neonates 
and infants, multiple administration may be required to elicit 
sufficient levels of immunity. Administration should begin 
within the first month of life, and at intervals throughout 

30 childhood, such as at two months, six months, one year and two 
years, as necessary to maintain sufficient levels of 
protection against native (wild-type) PIV and/or other subject 
viral infections. Similarly, adults who are particularly 
susceptible to repeated or serious PIV infection, for example 

35 health care workers, day care workers, family members of young 
children, the elderly, and individuals with compromised 
cardiopulmonary function, may require multiple immunizations 
to establish and/or maintain protective immune responses. 
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Levels of induced immunity provided by the vaccines 
of the invention can be monitored by measuring amounts of 
neutralizing secretory and serum antibodies. Based on these 
measurements, vaccine dosages can be adjusted or vaccinations 
5 repeated as necessary to maintain desired levels of 

protection. Further, different vaccine viruses may be 
advantageous for different recipient groups. For example, an 
engineered PIV strain expressing an additional protein rich in 
T cell epitopes may be particularly advantageous for adults 

10 rather than for infants. 

In yet another aspect of the invention the PIV is 
employed as a vector for transient gene therapy of the 
respiratory tract. According to this embodiment the 
recombinant PIV genome or antigenome incorporates a sequence 

15 which is capable of encoding a gene product of interest: The 
gene product of interest is under control of tYie same or a 
different promoter from that which controls PIV expression. 
The infectious PIV produced by coexpressing the recombinant 
PIV genome or antigenome with the N, P, L and other desired 

20 PIV proteins, and containing a sequence encoding the gene 
product of interest, is administered to a patient . 
Administration is typically by aerosol, nebulizer, or other 
topical application to the respiratory tract of the patient 
being treated. Recombinant PIV is administered in an amount 

25 sufficient to result in the expression of therapeutic or 
prophylactic levels of the desired gene product. 
Representative gene products which may be administered within 
this method are preferably suitable for transient expression, 
including, for example, interleukin-2 , interleukin-4 , gamma- 

30 interferon, GM-CSF, G-CSF, erythropoietin, and other 

cytokines, glucocerebrosidase, phenylalanine hydroxylase, 
cystic fibrosis transmembrane conductance regulator (CFTR) , 
hypoxanthine- guanine phosphoribosyl transferase, cytotoxins, 
tumor suppressor genes, antisense RNAs, and vaccine antigens. 



The following examples are provided by way of 
illustration, not limitation. 
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EXAMPLE I 

Construction of Plasmid p218(131) 
Encoding Negative sense PIV Genomic RNA 



5 A full cDNA clone designated p218(131) (Fig. 1; SEQ 

ID NO: 1) (deposited under the terms of the Budapest Treaty 
with the American Type Culture Collection (ATCC) of 12301 
Parklawn Drive, Rockville, Maryland 20852, U.S.A., and granted 
the designation 97991) was constructed to encode the complete 

10 15462 nt genomic sequence of HPIV3 JS strain. A hepatitis 

delta ribozyme was placed abutting the 3' end of the genomic 
sequence such that self -cleavage would yield the 3' end of 
HPIV3 (Perrotta and Been, Nature 350 : 434-436, (1991), 
incorporated herein by reference in its entirety) . A T7 

15 transcription terminator was placed following the delta 

ribozyme. The T7 promoter was placed adjacent to the 5" end 
of the genomic sequence such that the 5" terminal nucleotide 
of the HPIV3 genome was the first nucleotide synthesized. In 
this configuration, the cDNA encodes a complete negative-sense 

20 copy of PIV3 genomic RNA containing the correct genomic 
termini without any additional heterologous nucleotides. 

The HPIV3 cDNA was assembled from 14 overlapping 
subclones (termed A*-L, which letters in parentheses designate 
individual plasmids and do not refer to specific viral genes) 

25 constructed by reverse transcription (RT) and polymerase chain 
reaction (PCR) of RNA isolated from virions purified by 
sucrose gradient centrif ugation (Stokes et al., supra , 1992; 
Stokes et al . , supra, 1993, each incorporated herein by 
reference in its entirety) . The subclones spanned the 

30 following nucleotides of genomic RNA (numbered with the 3 "end 
designated as position 1): 1-2058 (A*), 1874-3111 (A'), 
3086-5140 (C) , 4348-5276 (C) , 5072-6695 (D*) , 5904-8532 (E) , 
7806-9898 (F) , 9632-10740 (F') , 9760-10955 (G) , 10862-11925 
(H) , 11835-12868 (I), 12426-13677 (J), 13630-14496 (K) , and 

35 14467-15462 (L) . Each fragment was cloned into pBluescript 
KSII (Strategene, La Jolla, CA) using conventional cloning 
techniques and was sequenced completely. 
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Plasinid p(L) was then subjected to site-directed 



mutagenesis to introduce the T7 promoter via a single- stranded 
DNA intermediate according to the MUTA-GENE procedure (BioRad, 
Hercules, CA) . The T7 promoter was positioned so that 
5 transcription initiates at the precise 5" end of the HPIV3 
genome using the negative-sense mutagenic primer: 
5 " -AATACGACrCACrA TA*ACCAAACAAGAGAAG- 3 " (SEQ ID NO: 2; T7 
sequences are italicized, HPIV3 -specif ic sequences are 
underlined, and the 5 '-end HPIV3 nucleotide, genome position 

10 15462, is indicated by an asterisk) . This modified p(L) was 

designated p{L*) . Plasmid p(E) was modified to yield p(E*) by 
the same method using the negative-sense mutagenic 
oligonucleotide 5'-CCAAGTACTATGAGATGCTTGATT-3' (SEQ ID NO: 3) 
to insert three nucleotide substitutions (underlined) into the 

15 HN gene at HPIV3 position 7903, 7913, 7915 (Fig. 1). These 
substitutions removed an Hga I site, inserted a Sea 1 site, 
and modified amino acid 370 of the encoded HN protein such 
that the epitope recognized by monoclonal antibodies (mAb) 
423/6 and 170/7 was ablated (van Wyke Coelingh et al., 

20 Virol , 61:1473-1477, (1987), incorporated herein by 

reference) . The p(E*) to p(K) subclones were assembled in a 
step-wise fashion into the p(L*) plasmid to give 
p(E*FF'GHIJKL*) (Fig. lA) . This plasmid includes HPIV3 
nucleotides 5904-15462 with the T7 promoter adjacent to 

25 nucleotide 15462 at the 5' end of the genome. Subclones p(A*) 
to p(E) were assembled into a second, overlapping subclone, 
p(A*A"CC'D*E) which contained HPIV3 nucleotides 1-8532. 



were sequenced completely. In addition to the introduced 
30 point mutations described above, the cDNA differed from the 

authentic JS HPIV3 sequence (Stokes et al., supra . 1992) by a 
single nucleotide substitution at position 1615 which was 
within the N gene and caused a substitution at amino acid 506 
in the encoded protein. Three other nucleotide substitutions 
35 were found at positions 10355, 11333, and 15248 in the L gene 
which did not change the encoded protein (Fig. 2) . These 
three noncoding changes were retained as additional sequence 
markers to identify recombinant virus (designated rPIV) 



Both subclones p (E*FF"GHIJKL*) and p(A*A"CC'D*E) 
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derived from cDNA, and the mutation in the N gene was 
corrected as described later. 

Subclone p (A*A'CC'D*E) was then modified to insert 
the hepatitis delta virus ribozyme and T7 terminator adjacent 
5 to HPIV3 position 1. An HPIV3 minigenorae in which the 3' end 
of the HPIV3 genome ( GGT >^GGG) (underlined) was generated 
through self -cleavage of a flanking hepatitis delta virus 
antigenomic ribozyme (shown in part in bold -type) was 
previously constructed (Dimock and Collins, J. Virol. 67: 

10 2772-2778, (1993); Perrotta and Been, supra , (1991), each 
incorporated herein by reference in its entirety) . The 
ribozyme in turn was followed by a T7 transcription 
terminator. This minigenome cDNA was used as a template in a 
PGR reaction which modified the sequence adjacent to the 

15 ribozyme cleavage site to be a Sma I site (CCCiGGG) and placed 
an Apal site (GGGCC>fcC) on the downstream side of the T7 
terminator. The PGR product was cloned into pKSII which had 
been digested with BssHII and made blunt-ended by filling in, 
yielding p218. 

20 p218 was designed such that any sequence could be 

introduced into the opened Sma I site by blunt -end ligation 
and its RNA transcript would be cleaved at the delta ribozyme 
cut site (NNN^GGG) . The p (A*A''CC''D*E) subclone was digested 
with Hga 1 and Sal I (8533), which released the HPIV3 cDNA, 

25 and was filled in with dNTPs and T4 DNA polymerase to give 

blunt termini. The Hga 1 site is 10 nucleotides upstream of 
HPIV3 position 1 and, when digested and filled in, leaves a 
blunt terminus beginning with HPIV3 position 1. The modified 
Hga 1-Sal I fragment was gel purified and cloned into the Sma 

30 I site of p218. The mutation in the N gene (T at nt 1615) was 
corrected to the JS wt sequence (A at nt 1615) (see GenBank 
accession #Z11575, incorporated herein by reference) using 
Kunkel mutagenesis (Kunkel et al.. Methods Enzymol . 154: 367- 
382, (1987), incorporated herein by reference in its 

35 entirety). This plasmid was designated p218 (A*A'CC'D*E) (Fig. 
1) . 
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The Xho 1-Ngo MI fragment of p (E*FF"GHIJKL*) , which 
contained the T7 promoter and the HPIV3 cDNA from nucleotides 
7438-15462, was cloned into the Xho l-Ngo MI window of 
p218 (A*A'CC"D*E) (Fig. 1). This joined the two fragments of 
5 HPIV3 cDNA at HPIV3 nucleotide 7438, yielding a plasmid 

containing a full-length HPIV3 cDNA encoding a negative- sense 
genomic RNA with the above-mentioned three desired mutations 
in the HN gene and three incidental mutations in the L gene. 
The final construct, designated p218(131) (Fig. 1; SEQ ID NO: 

10 1) , was then sequenced in its entirety by automated sequencing 
at the NCI Frederick Cancer Research and Development Center 
(Frederick, MD) using the Taq DYE Deoxy Terminator cycle 
sequencing kit (ABI, Foster City, CA) . This identified an 
additional change in the HN gene, namely a change of C to T in 

15 the HN gene at position 7593 which changed HN amino acid 263 

from threonine to isoleucine that is also indicated in Fig. 2. 

EXAMPLE II 

20 Transcription and RNA Replication System for HPIV3 

The present example describes compositions and 
methods for producing a reconstituted transcription and RNA 
replication system for human parainfluenza virus type 3 

25 (HPIV3) . This exemplary system was developed using components 
expressed intracellularly from transfected plasmids driven by 
a T7 RNA polymerase supplied by a vaccinia virus recombinant. 
The system is based on a negative -sense analog of HPIV3 
genomic RNA in which the viral genes were deleted and replaced 

30 with a polynucleotide encoding bacterial chloramphenicol 
acetyl transferase (CAT) . The N, P and L proteins are 
expressed from cotransf ected plasmids so as to direct 
efficient transcription and RNA replication. Transcription 
according to this example yields subgenomic polyadenylated 

35 mRNA, which can be readily isolated, e.g., by oligo(dT) 

chromatography. RNA replication according to this example 
yields mini- ant igenome and progeny minigenome, which are shown 
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to be encapsidated based on resistance to digestion with 
micrococcal nuclease. 

A) Viruses and cells 

A vaccinia virus recombinant, vTF7-3, that expresses 
5 bacteriophage T7 RNA polymerase, was provided as described by 
Fuerst et al . ( Proc. Natl. Acad. Sci. U>S.A. 83 : 8122-8126, 
1986, incorporated herein by reference in its entirety). 
HEp-2 monolayer cultures were maintained at 37®C in 5% CO2 
with OptiMEM 1 (Life Technologies, Gaithersburg, MD) 
10 supplemented with 2% fetal bovine serum (FBS) , 50 /xg/ml 
gentamicin sulfate and 2mM glutamine. 

B) cDNAs 

cDNAs corresponding to ORFs of the N, P, and L genes 
of the JS strain of HPIV3 (GenBank #Z11575; Stokes et al . , 

15 1992) were individually cloned into the Nco 1-Sal I window of 
plasmid pTM-1, in which transcription is mediated by T7 RNA 
polymerase and translation by an internal ribosome entry site 
preceding the foreign ORF (Elroy-Stein et al . , Proc. Natl. 
Acad. Sci. U.S.A. 86: 6126-6130 (1989) , incorporated herein by 

20 reference in its entirety) . Each gene was first modified by 
polymerase chain reaction (PGR) to place an Nco I or Nco 
I-compatible site at the translational start site and a Sal I 
site on the downstream end. 

The plasmid p(131), which is similar to p218(131) 

25 except that it lacks the hepatitis delta virus ribozyme, was 
used as a template for each PGR. The primers used to amplify 
the N ORF were GCGTATAATTTGAACarGTTGAGGGTATTTG (SEQ ID NO: 4; 
forward primer relative to positive-sense) and 
GATTAAAATGTTGGTCGACTTAGTTGCTTCC (SEQ ID NO: 5; italics 

30 represent restriction enzyme sites, and the translational 

start site is in bold) . The PGR product, a 1578 bp fragment 
flanked by an Afl III and Sal 1 site, was cloned into the Nco 
I- Sal I window of pTM-1 to yield pTM(N) . 

The primers used to amplify the PIV3 phosphoprotein 

35 (P) ORF were 5- ' CGATAGAGAGTCCATGGAAAGCGATGGTAAAAACTATC-3 ' (SEQ 
ID NO: 6; forward primer) and 5'- 

CGGTGTCGTTTCTTTGrCGACrCATTGGCAATTGTTG-3' (SEQ ID NO: 7; reverse 
primer) . A full-length cDNA of JS strain of genomic RNA 
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(pl31) was used as template for the PGR. The resultant PGR 
product was an 1851 bp fragment flanked by an Nco 1 and Sal I 
restriction site (in italics) . The PGR product was then 
cloned into the Nco I -Sal I window of pTM-1 to yield pTM(P) . 

A second PGR was performed to amplify the PIV3 
phosphoprotein P ORF without the G ORF. pl31 was again used 
as template cDNA. A different forward primer and the same 
reverse primer were used to amplify the PIV3 P ORF without G; 
5' -GGATAGAGAGTCCaTGGA7^GGGACGGTAAAAAGTATG-3' (SEQ ID NO: 74; 
forward primer) and 5' -GGGTGTCGTTTGTTTG!^CGACTGATTGGGAATTGTTG- 
3' (SEQ ID NO: 7; reverse primer) . The resultant PGR product 
was an 1851 bp fragment flanked by an Nco 1 and Sal 1 
restriction site (designated by italics) . The underlined 
nucleotide in the forward primer represents a nucleotide 
substitution which is silent in the P ORF but changes the 
start codon of the G ORF to threonine. The next start codon 
for the G ORF is more than 4 00 nucleotides downstream. Thus, 
only the P protein would be produced. The PGR product was 
then cloned into the Nco I -Sal I window of pTM-1 to yield a 
second plasmid, pTM(P no G) . 

The L ORF of HPIV3 was cloned into pTM-1 in three 
parts: the ends were derived from PGR products and the main 
body was a restriction fragment from p218(131). The upstream 
end of the L ORF was amplified using the primers 
GGAAAGGGTGCCCGGGCCATGGAGAGTGAATGTAAGAATGGC (SEQ ID NO: 8) and 
GAAATTCCTTAATCGAITGTGTAGATTG (SEQ ID NO: 9) . This yielded the 
1,020-bp PGR product LI in which positions 8625-9645 of the 
full-length genome were flanked by Sma I and Nco I sites on 
the upstream end and a Cla I site on the downstream end (all 
three sites are italicized) . The downstream end of the L ORF 
was amplified using the primers GGGATGAAGTOTAAGATACGTAAGAAAGAC 
(SEQ ID NO: 10) and GGTTAGGATATGTCGACATTGTATTTATG (SEQ ID NO: 
11). This yielded the 1,733-bp PGR product L2 in which 
positions 13,645-15,378 of the full-length genome were flanked 
by a SnaB 1 and Sal I site (italicized) . Plasmid p(131) was 
digested with Cla 1 and Pst I to yield the 4,487-bp fragment L 
middle containing positions 9,630-14,120 of the full-length 
genome. LI and L middle were joined at the common Cla 1 site 
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and cloned into the Sma I-Pst I window of pBluescript to yield 
p(Ll+L middle) . The L2 fragment was then cloned into the Pst 
I -Sal I window of p(Ll-f-L middle) to yield the complete L ORF 
flanked by Nco I and Sal I . This was then cloned into the Nco 
5 I -Sal I window of pTM-1 to yield pTM(L) . The sequences of 

PCR-generated regions of pTM(N) (SEQ ID NO: 20), pTM(P) (SEQ 
ID NO: 21), and pTM(L) (SEQ ID NO: 22) were confirmed by the 
dideoxynucleotide sequencing method. 

To increase the efficiency of T7 transcription, 

10 certain modifications were made to a cDNA construct encoding a 
negative -sense PIV minigenome, called PIV3-CAT(-) (Dimock and 
Collins, J. Virol. 67 : 2772-2778 (1993), incorporated herein 
by reference in its entirety) . PIV3-CAT{-) includes the 
3 "-terminal 111 nucleotides and 5 "-terminal 115 nucleotides of 

15 the HPIV3 genome fused to a negative -sense copy of the CAT 

ORF. This cDNA was designed to yield, upon linearization with 
Hgal and transcription with T7 RNA polymerase, a minigenome 
containing the exact correct ends of the HPIV3 genome. Two 
successive rounds of PCR, using mutagenic oligonucleotides 

20 which added successive extensions to the cDNA end, were used 
to replace the Hgal site with the hepatitis delta ribozyme 
(Perotta and Been, Nature 350 : 434-436 (1991) , incorporated 
herein by reference in its entirety) , such that self -cleavage 
generates the correct 3" HPIV3 genomic end. A T7 

25 transcriptional termination signal was inserted immediately 
after the ribozyme (Fig. 3) to yield PIV3-CAT-delta . 

PIV3-CAT-delta cDNA was modified by PCR mutagenesis 
to insert one, two or three G residues between the T7 promoter 
and the 5' end of the minigenome, using restriction sites 

30 flanking the trailer and T7 promoter. This modification 
yielded increased efficiency of T7 transcription. In 
preliminary experiments, the minigenome containing two G 
residues, called PIV3-CAT-GG, was the most active in the 
expression of CAT in the reconstituted transcription and 

35 replication system described below, and was used for all 
subsequent derivatives. 

The PIV3-CAT-GG cDNA was further modified by 
overlapping PCR mutagenesis to introduce modifications 
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simultaneously at two sites, as follows. First, the sequence 
T7CT, which contains tandem transcriptional termination motifs 
for the vaccinia virus early-stage RNA polymerase (T5NT) (Yuen 
and Moss, Proc . Natl. Acad. Sci. U.S.A. 84 ; 6417-6421 (1987), 
5 incorporated herein by reference in its entirety) , was 

inserted into the positive-sense strand cDNA strand between 
the delta ribozyme and T7 transcriptional terminator (Fig. 1) . 
This motif, and a second motif described herein below, were 
added to prevent promiscuous transcription of the CAT gene by 

10 the vaccinia virus early RNA polymerase. Second, the 

minigenome cDNA insert was modified at the junction between 
the N nontranslated region and CAT gene to contain (i) the 
insertion of a second vaccinia virus termination motif (T^AT) 
into the positive-sense cDNA strand at positions 103 to 109 

15 relative to the encoded minigenome, and (ii) the insertion of 
0 to 6 G residues (negative-sense) at minigenome position 112 
(Fig. 3) . The overlapping PGR mutagenesis involved three sets 
of reactions (PGR 1, 2 and 3) performed as follows. PGR 1 was 
a set of seven parallel reactions [PCRl (0) to (+6)] which 

20 used the PIV3-CAT-GG cDNA as a template and the following two 
mutagenic oligonucleotides as primers: the forward primer was: 
■^^^GGGGTTATGCTACTGGAGGCTTTTTTTCTCCCTTAGCCATCCG'" (SEQ ID NO: 
12) and the reverse primer was: 

^2^GTGCArrCrAGA(N)TTATAAAAATTATAGAGTTCGG^° (SEQ ID NO: 13). The 
25 bold sequence in the first oligonucleotide is the upstream 
tandem vaccinia terminator, and the bold sequence in the 
second oligonucleotide is the second terminator. This 
reaction amplified the ribozyme and adjacent leader region and 
inserted the mutations described above. PGR 2 was a single 
30 reaction that used the PIV3-CAT-GG cDNA as a template, and a 

forward primer that hybridized in plasmid sequence upstream of 
a unique NgoMl site (Fig- 3) , and a reverse primer 
complementary to the forward primer of reaction one. Thus, 
the products of PGR 1 and 2 overlapped at this latter 
35 sequence. The products of PGR 1 (0) to (+6) and PGR 2 were 
gel purified. The products of PCRl (0) to (+6) were each 
mixed separately with an aliquot of PGR 2 product and 
amplified in a third reaction {PCR3 (0) to (+6)) which also 
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contained the forward primer of PGR 2 and the reverse primer 
of PGR 1. The products of PCR3 (0) to (+6) were digested with 
NgoMI, which cuts in plasmid- specif ic sequence, and Xbal, 
which cuts at the upstream end of the CAT gene (Fig. 3) , and 
5 cloned into the NgoMl-Xbal window of PIV3-CAT-GG. This 

resulted in a panel of cDNAs encoding minigenomes which were 
named according to the number of inserted G residues: PIV3-CAT 
0 to PIV3-CAT +6. The structures of all DNA regions derived 
from PGR were confirmed by dideoxynucleotide sequencing. 

10 G) Trans feet ion 

HEp-2 cells were grown to 90% confluence in 6 well 
plates. Each well of a six-well plate (1.5 X 10^ cells) was 
transfected with 0.4 ng pTM(P), 0.4 /zg pTM(N), 0.05 fig pTM(L), 
and 0.4 fig minigenome plasmid. The plasmids were added to 0.1 

15 ml of OptiMEM (Life Technologies) and mixed with 0.1 ml of 

OptiMEM containing 12 /xl of LipofectACE (Life Technologies) . 
After an incubation period of approximately 15 minutes at room 
temperature, 0.8 ml of OptiMEM 1 containing 2% calf serum and 
1.5 X lO*' pf u of VTF7-3 was added to each well. The plates 

20 were incubated at 37*^G for 12 hours after which the media was 
replaced with fresh OptiMEM 1 containing 2% fetal bovine 
serum. The cells were then incubated at 37°G for a total of 
48 hours and harvested for RNA analysis and CAT assay. Each 
minigenome was represented in triplicate (3 wells) which was 

25 scraped into the medium and pooled. 
D) GAT assay 

An aliquot representing 3.33% (1.5 X 10^ cells) of 
each pooled sample of harvested cells described above was 
removed for GAT assay. The aliquot was centrifuged at 1,000 

30 rpm for 5 minutes and the supernatant discarded. The cell 
suspension was washed with 1 ml of 40 mM Tris, pH 7.5, 1 mM 
EDTA, 150 mM NaCl and resuspended in 50 /il 0.25 M Tris, pH 
7.5. Lysate was prepared by three cycles of freezing and 
thawing and clarified by centrifuging at 8,000 rpm for 5 

35 minutes. 1 /xl of lysate was assayed for the ability to 
acetylate D-threo- [dichloroacetyl 1-^*C] chloramphenicol 
(Amersham) using a conventional assay (Gorman et al . , Mol . 
Cell. Biol. 2: 1044-1051 (1982), incorporated herein by 
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reference in its entirety) . Acetylation was visualized by 
thin- layer chromatography and (quantified by phosphoimager 
analysis (Molecular Dynamics, Sunnyvale, CA) . 
E) RNA analysis 
5 The remaining cell harvest of each pooled sample was 

divided into three equal parts for isolation of encapsidated 
RNA, total RNA, and mRNA. The three aliquots were centrifuged 
at 1,000 rpm for five minutes and the supernatants discarded. 
Two aliquots of cell suspension were resuspended in 50 /xl of 

10 RSB (lOmM NaCl, lOmM Tris, pH 7.5, 1.5mM MgCl^) containing 1% 
Triton X-lOO, 0.5% DOC. 50 /il of 10 mM Tris 7.5, 1 mM CaClj, 
and 20 /xg (Img/ml stock) of micrococcal nuclease was then 
added to one aliquot, and the other received the same mixture 
without micrococcal nuclease (Baker & Moyer, J. Virol. 62: 

15 834-838 (1988) , incorporated herein by reference in its 

entirety) . The purpose of the micrococcal nuclease was to 
destroy nonencapsidated RNA, and the conditions used had been 
optimized in preliminary experiments. The mixtures were 
incubated at 30^C for 30 min and the RNA was isolated with 

20 Trizol (Life Technologies) according to the procedure of the 
supplier. The third aliquot of cell suspension was processed 
for RNA purification with Trizol and the purified RNA was 
separated by oligo(dT) cellulose chromatography into 
polyadenylated and nonpolyadenylated fractions (Grosfeld et 

25 al., J, Virol. 69 ; 5677-5686 (1995), incorporated herein by 
reference in its entirety). RNA samples were run on 1.5% 
agarose gels containing 0.44 M formaldehyde, transferred to 
nitrocellulose (Chomczynski , Anal. Biochem. 201 : 134-139 
(1992) , incorporated herein by reference in its entirety) , 

30 hybridized with strand specific riboprobes, and quantified by 
pho spho image r ana ly s i s . 

EXAMPLE III 

Construction and Expression of Modified PIV3 Miniaenomes 

35 

In the present example, a panel of cDNAs was 
constructed to encode PIV3 minigenomes which differed in 
length by single nucleotide increments. Transcription and RNA 
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replication in this reconstituted system were the most 
efficient for the minigenome whose length was an even multiple 
of six. In this context, members of the Paramyxovirus and 
Morbillivirus genera typically abide by a "rule of six," i.e., 
5 genomes (or minigenomes) replicate efficiently only when their 
nucleotide length is a multiple of six (thought to be a 
requirement for precise spacing of nucleotide residues 
relative to encapsidating NP protein) . However, the present 
findings illustrate that minigenomes whose lengths were one 

10 nucleotide greater than or less than an even multiple of six 
were surprisingly active, especially in RNA replication. 

A panel of seven cDNAs was constructed to encode 
seven PIV3-CAT minigenomes, called PIV3-CAT 0 to +6, that 
differ in length by single-nucleotide increments (Fig. 3) . 

15 Each minigenome is a short negative- sense analog of HPIV3 
genomic RNA in which the viral genes had been deleted and 
replaced with a negative-sense copy of the CAT ORF. The CAT 
ORF is flanked by nontranslated segments of the N and L genes, 
the N GS and L GE transcription motifs, and the 3^ leader and 

20 5' trailer extragenic termini of genomic RNA. The 5" end of 
each PIV3-CAT minigenome is defined by the adjacent promoter 
for T7 RNA polymerase. This promoter was constructed to 
contribute an extension of two nonviral G residues to the 5" 
end of the encoded minigenome, as described above. The 

25 presence of additional G residues adjacent to the 

nontranscribed core of the T7 promoter improves its 
transcriptional efficiency, and preliminary work showed that 
presence of two G residues provided the highest levels of 
activity in the reconstituted system described below. These 

30 two G residues are not included in minigenome length 
calculations. The 3' minigenome end is created by 
self -cleavage by an abutting hepatitis delta virus ribozyme, 
which would generate the correct 3 ^-terminal nucleotide. The 
seven PIV3-CAT minigenomes differ by single-nucleotide 

35 increases in length due to the insertion of 0 to 6 G residues 
at the junction between the N nontranslated region and the CAT 
gene (Fig. 3) , and range in length from 898 to 904 
nucleotides. The PIV3-CAT +2 minigenome is the only one which 
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is an even multiple of six (it should be noted that each 
minigenome contained two additional nonviral G residues at the 
5' end contributed by the T7 promoter, but it is assumed that 
these would be lost during intracellular HPIV3 -mediated RNA 
5 replication) . 

Each PIV3-CAT cDNA was transfected into HEp-2 cells 
that had been infected with vTF7-3, a vaccinia virus 
recombinant that expresses T7 RNA polymerase. Plasmids 
encoding the N, P and L proteins under the control of the T7 

10 promoter were transfected in parallel. The P cDNA had been 
modified by site -directed mutagenesis to eliminate the 
translational start site of the C ORF, as described above. 
Cells were harvested at 48 h post-infection. An aliquot of the 
cell suspension was processed fpr CAT enzyme assay (Fig. 4) . 

15 The remaining cells were divided into three equal aliquots and 
processed for RNA analysis as described below. 

The minigenome cDNA was further modified to contain 
two tandem vaccinia virus early-gene transcription termination 
motifs {T7NT) in the positive-sense plasmid strand upstream of 

20 the PIV3-CAT insert, and a third one (T5AT) in the same strand 
immediately upstream of the CAT ORF (Fig. 3) . These were 
designed to minimize promiscuous transcription of the CAT ORF 
by vaccinia virus polymerase (Grosfeld et al. (1995), supra ) . 
CAT expression was reproducibly detected when each of PIV3-CAT 

25 minigenomes was complemented by the N, P and L plasmids (Fig. 
4) , and detection of CAT was dependent on all three PIV3 
proteins. However, expression was much higher for PIV3-CAT+2, 
which has a nucleotide length that is an even multiple of six. 
Preferred ratios and amounts of the minigenome and support 

30 plasmids were determined based on CAT enzyme expression. 

EXAMPLE IV 

Synthesis of Positive-Sense RNAs bv PIV Minigenomes 

35 Transcription and replication of PIV minigenomes was 

confirmed by detection of RNA products of both processes. As 
described in the foregoing Example, three equal aliquots of 
cell suspension were taken for RNA analysis. One aliquot was 
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used for oligo(dT) analysis, as described below. The other 
two aliquots were lysed with detergent and incubated with 
micrococcal nuclease or mock- treated. RNA was then isolated, 
separated by electrophoresis on formaldehyde agarose gels, 
5 transferred to nitrocellulose, and analyzed by hybridization 
with negative -sense CAT riboprobe. RNA from 

micrococcal -treated and mock- treated lysates are shown in Fig. 
5A upper and lower panels, respectively. 

Analysis of RNA from mock- treated lysates showed 

10 that complementation of each minigenome with the N, P and L 

plasmids resulted in the synthesis of a band of RNA which was 
very similar in size to a marker consisting of RNA expressed 
by the 931-nucleotide RSV-CAT C2 minigenome (Grosfeld et al. 
(1995), supra ) . Phosphorimagery analysis is shown in Fig. 5B. 

15 Little or no RNA was detected when the N or L plasmids were 
omitted, confirming that these RNAs are products of the 
reconstituted PIV3 polymerase. 

Each PIV3-CAT minigenome is expected to encode two 
positive-sense RNAs, namely the mini- ant igenome and the 

20 subgenomic CAT mRNA. Each mini- ant igenome is expected to be 
the exact complement of its minigenome, which was 898 to 904 
nucleotides in length. The predicted subgenomic mRNA is 
defined by the GS and GE signals, and is expected to be 804 
nucleotides in length and contain a polyA tail of 100 to 200 

25 nucleotides. 

Detection of a single gel band of positive-sense RNA 
in Fig. 5A (lower panel) suggested that the ant igenome and 
mRNA were not resolved by gel electrophoresis. Accordingly, 
treatment with micrococcal nuclease was used to identify 

30 ant igenome RNA, since the ant igenome (and genome) but not mRNA 
would be encapsidated and resistant to digestion. The use of 
micrococcal nuclease for this purpose is well established 
(Baker & Moyer (1988) , supra ) , and the conditions selected 
were verified with RSV minireplicons and shown to completely 

35 degrade mRNA contained in the HEp-2 cell lysates. Residual 
RNA was purified and analyzed by Northern blot analysis with 
negative-sense riboprobe (Fig. 5A, upper panel) and 
quantitated by phosphorimagery (Fig. 5B,- note that in this 
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analysis the micrococcal- treated and untreated RNA amounts 
were normalized separately) . These investigations revealed 
the presence of a population of protected RNA corresponding to 
the positive-sense encapsidated mini - ant igenome . Among 



RNA, accumulation was greatest in the case of the +2 
minigenome, which is 900 nucleotides in length and thus a 

10 multiple of six. However, substantial amounts of RNA also 
accumulated in the case of the minigenomes which did not 
exhibit a length corresponding to a multiple of six 
nucleotides, in particular minigenomes +1 and +3 which were 
one nucleotide longer or shorter than the +2 minigenome. In 

15 fact, the amount of encapsidated antigenome produced by the +1 
and +3 mingenomes was 85% and 72% that of the +2 minigenome 
(Fig. SB) . Even the least efficient minigenome, the +5 
minigenome, was 20% as active as the +2 minigenome as 
determined by measurement of accumulated encapsidated RNA. In 

20 the case of measurements to detect total positive -sense RNA, 
the +1 and +3 minigenomes produced 52% and 45% as much total 
RNA as the +2 minigenome. 



final aliquot of harvested cell suspension was processed for 
25 RNA purification. The RNA was then subjected to oligo(dT) 

chromatography. RNAs which failed to bind, and those which 
bound and were eluted in low salt buffer, were analyzed by 
Northern blot hybridization (Fig. 6A) and phosphorimagery 
(Fig. 6B; note that in this case the bound and unbound are 
30 normalized together relative to the bound RNA of the +2 

minigenome) . These assays showed that approximately 64% of 
positive-sense RNA was polyadenylated, as expected for 
subgenomic mRNA. The accumulation of mRNA was greatest for 
the +2 minigenome. However, substantial amounts of mRNA also 
35 were observed for the other minigenomes. The amount of mRNA 
synthesized by the +1 and +3 minigenomes was 30% and 20% 
respectively compared to that synthesized by the +2 



several experiments, this protected RNA accounted for 
approximately 3 to 15% of the positive-sense RNA. 

For both the total and the micrococcal -resistant 



To confirm the presence of subgenomic mRNA, the 
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minigenome, and was approximately 13% for the least active 
minigenomes . 

EXAMPLE V 

5 Synthesis of Negative Sense RNA by PIV Miniaenomes 

The various PIV3-CAT minigenomes described in the 
foregoing examples directed synthesis of mRNA and 
positive-sense encapsidated mini- ant igenome, the latter 

10 representing the first step in RNA replication- The second 
step in RNA replication involves synthesis of encapsidated 
progeny minigenome from the mini -ant igenome product. To 
evaluate this latter process, the samples of RNA from 
mock-treated and nuclease-treated lysates described in the 

15 preceding Example were analyzed by Northern blot hybridization 
with positive-sense CAT riboprobe (Fig. 7A) and quantitated by 
phosphorimagery (Fig. 7B) . 

Analysis of RNA from mock- treated lysates (Fig. 7A, 
lower panel) showed that considerable amounts of minigenome 

20 accumulated intracellular ly in all samples, including negative 
controls in which the N or L support plasmid was omitted. The 
analyses described in Figs. 5A-B and 6A-B showed that the 
synthesis of positive -sense RNA was insignificant under these 
conditions. Therefore, the minigenome observed in the absence 

25 of N or L could not be the product of RNA replication mediated 
by the reconstituted HPIV3 polymerase, and instead must be the 
product of T7 transcription of transfected plasmid. 

Minigenome produced by the reconstituted HPIV3 
polymerase is expected to be encapsidated, whereas much of the 

30 minigenome produced by T7 RNA polymerase is expected to be 
unencapsidated. Therefore, RNA from the same micrococcal 
nuclease-treated samples described for Figs. 5A-B were used to 
prepare a second blot, which was hybridized with 
positive-sense CAT riboprobe (Fig. 7A, upper panel) . This 

35 showed that all minigenome RNA accumulated in the absence of 
the N protein was degraded (Fig. 7A, upper panel, lane 1), as 
expected. Essentially all of the minigenome which accumulated 
in the absence of L was also sensitive to degradation (Fig. 



wo 98/53078 



PCT/US98/10551 



62 

7A, upper panel, lane 2) . Plasmid-derived minigenome 
synthesized in the absence of L, and in the presence of N and 
P alone, did not appear to occur efficiently. 

when the complete set of three support plasmids was 
5 present, significant amounts of micrococcal nuclease-resistant 
minigenome RNA accumulated for each of the minigenomes (Fig. 
7A, upper panel) . As was the case with the positive-sense 
RNAs, the greatest amount of progeny minigenome was observed 
with the +2 minigenome. The +1 and +3 minigenomes were next in 

10 abundance, with levels of genomic RNA that were 67% and 42% of 
that of the +2 minigenome. 

The foregoing examples demonstrate that the HPIV3 N, 
P and L proteins were necessary and sufficient for efficient 
transcription and RNA replication. The very robust nature of 

15 transcription and RNA replication mediated by the 

reconstituted PIV3 polymerase confirmed the functionality of 
the encoded proteins. It is further expected that inclusion 
of additional viral proteins within the expression system will 
augment or modify these processes. Coexpression of PIV C, D 

20 and potentially V, within the compositions and methods of the 
invention will be useful, e.g., to augment and/or modify RNA 
replication. For this purpose, plasmids will be constructed 
and assayed according to the foregoing methods to achieve 
coexpression of one or more of these elements to determine 

25 their effects on PIV transcription and RNA replication, as 
well as on PIV phenotype in suitable infection models. 

EXAMPLE VI 

30 Construction of Infectious. Recombinant PIV from cDNA 

The following examples describe production of 
infectious, recombinant PIV (rPIV) by intracellular 
co-expression of four plasmid-borne cDNAs. These cDNAs 
35 separately encode a complete HPIV3 genome and the HPIV3 
nucleocapsid protein N, the phosphoprotein P, and the 
polymerase protein L. 
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A) Viruses and cells 

Modified vaccinia strain Ankara (MVA) , the vaccinia 
virus recombinant that expresses bacteriophage T7 RNA 
polymerase, was provided according to Wyatt et al., Virol . 
5 210:202-205, 1995, incorporated herein by reference in its 

entirety. HEp-2 monolayer cultures were maintained at 37* C 
in 5% CO2 with OptiMEM 1 (Life Technologies) supplemented with 
2% FBS, 50/ig/ml gentamicin sulfate and 2mM glutamine. The JS 
wt strain of HPIV3 and its attenuated ts derivative, JS cp45, 
10 were propagated in LLC-MK2 cells as described by Hall et al., 
Virus Res. 22 ; 173-184, (1992), incorporated herein by 
reference in its entirety. 

B) cDNAs 

The full cDNA clone designated p218(131) (Fig. 1, 

15 SEQ ID NO: 1) encoding a complete genomic sequence of HPIV3 
was constructed as described above. A second cDNA clone 
designated p3/7(131) (SEQ ID NO: 14) (deposited under the 
terms of the Budapest Treaty with the ATCC, and granted the 
designation 97990) was constructed to encode a complete 

20 antigenomic sequence of HPIV3. p3/7(131) differs from 

p218(131) in that the positions of the T7 promoter and the 
ribozyme/T7 terminator relative to the HPIV3 cDNA insert have 
been interchanged. Thus, the first nucleotide synthesized in 
p3/7(131) is the end of the antigenome, namely the 

25 positive-sense complement of genome position 1, and the 3^ 

antigenome end defined by ribozyme cleavage is the complement 
of genome position 15462. A third clone designated 
p3/7{131)2G (SEQ ID NO: 15) (deposited under the terms of the 
Budapest Treaty with the ATCC, and granted the designation 

30 97989) was also constructed identical to p3/7(131), except 
that two G residues were inserted between the 5' end of the 
antigenome and the T7 promoter. 

For construction of p/37(131)2G and p3/7(131), the 
two plasmids p (A*A'CC''D*E) and p (E*FF'GHIJKL*) were modified 

35 and joined to encode the complete positive-sense antigenome of 
HPIV3 . First, the T7 terminator and delta ribozyme abutting 
the 3' end of HPIV3 in p (A*A"CC"D*E) were replaced by a T7 
promoter using PCR (see Fig. 8) . The positive-sense primer 



wo 98/53078 



PCT/US98/10551 



64 

5 " - GGCCCGTCGACGCGTAATACGACTCACTATAGG ACC TUVACAAGAG - 3 ' ( SEQ ID 
NO: 18) placed the T7 promoter (bold) adjacent to the 3" 
leader of HPIV3 (HPIV3 sequence underlined) with 2 G residues 
inserted between these two elements to improve transcription. 
5 A unique Mlu I site (italicized) was placed upstream of the 

T7 promoter for cloning purposes (Fig. 8) . The negative -sense 
primer 5^- ^^^^CGGCATCACGrGCTAC^^**^- 3" (SEQ ID NO: 19) spanned 
nt 1209-1224 of the HPIV3 N gene and included a unique Pml I 
site (italicized) present in the natural HPIV3 sequence. Both 

10 the PGR product and the parent template p218 (A*A"CC'D*E) were 
digested with Mlu I and Pml I and the PGR product was then 
cloned into the Aflu I-Pjt?I I window. A second PGR reaction was 
done using the same negative -sense primer and a positive-sense 
primer without 2G residues inserted between the T7 promoter 

15 and 3" end of HPIV3 . These plasmids were designated p(Left + 
2G) and p{Left +) . The construction of p(Left + 2G) is 
illustrated in Fig. 8, the construction of p(Left +) followed 
the same strategy. 

Plasmid p(E*FF"GHIJKL*) was modified by PGR to place 

20 the delta ribozyme and T7 terminator adjacent to the 5^ end of 
HPIV3 (Fig. 9) . The positive-sense primer: 
5 " -GGATTTGCGCGC^^^^^ AATTTAAATCATCTGG '^^^^- 3 " (SEQ ID NO: 16) 
introduced a BssH II site (italicized) just upstream of a 
unique Swa I site (italicized, underlined) in the L gene of 

25 HPIV3 {HPIV3 specific sequence is underlined) . The 
negative- sense primer: 

5 " -CCCAGGTCGGACCGCGAGGAGGTGGAGATGCCATGCCAGCCC^^^^^ACCA 
AAACAAGAGAAGAACTCTGTTTGG ^^^^^- 3' (SEQ ID NO: 17) placed a 
portion of the delta ribozyme (bold) which included a 

30 naturally- occurring unique Rsr 11 site (italicized) adjacent 

to the 5' end of the HPIV3 genome (negative -sense underlined) . 
This PCR product was digested with BssH II and Rsr 11 and 
cloned into the BssH 11-Rsr 11 window of plasmid p3/7 (Fig. 
9) . Plasmid p3/7 is identical to p218 except that the delta 

35 ribozyme and T7 terminator are in the reverse orientation. 

The p3/7 plasmid containing the above-mentioned insertion was 
designated pPIV3-3/7 and contained the complete delta ribozyme 
and T7 terminator adjacent to 5" terminal 651 nucleotides of 
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HPIV3. The Swal and NgoMI fragment of plasmid pPIV3-3/7 was 
then isolated and cloned into the Swal-NgoM I window of 
p(E*FF'GHIJKL*) . The resulting plasmid, designated p(Right +) 
placed the complete delta ribozyme and T7 terminator adjacent 
to the 5' end of HPIV3 (see Fig. 9) . The Xho I -NgoM I 
fragment of p (Right +) was cloned into the Xho I -NgoM 1 
window of p(Left +) and p(Left + 2G) resulting in plasmids 
p3/7(131) (SEQ ID NO: 14) and p3/7(131 2G) (SEQ ID NO: 15), 
respectively (Fig. 10) . These each encode the complete 
positive-sense analog of HPIV3 antigenomic RNA, with the 
latter cDNA containing two G residues adjacent to the T7 
promoter for improved transcriptional efficiency. 
C) Transfection 

HEp-2 cells were grown to 90% confluence in six well 
plates. Each well of a six-well plate (1.5 X 10^ cells) was 
transfected with the three previously-described support 
plasmids, 0.4/xg pTM(P), 0.4/xg pTM(N), 0.05/ig pTM(L), together 
with 5/xg of full-length genomic or antigenomic HPIV3 cDNA. 
The plasmids were added to 0.2 ml of OptiMEM 1 (Life 
Technologies) containing 12/il of LipofectACE (Life 
Technologies) . After an incubation period of approximately 15 
minutes at room temperature, 0.8 ml of OptiMEM 1 containing 2% 
fetal bovine serum and 1.5 X 10' pfu of MVA-T7 was added to 
each well. The cultures were incubated at 32 'C for 12 hours, 
after which the media was replaced with fresh OptiMEM 1 
containing 2% fetal bovine serum. The cultures were incubated 
at 32 'C for three additional days then harvested and passaged 
(referred to as passage 1) onto fresh HEp-2 monolayers. These 
passage 1 cultures were incubated at 32 *C for five days, and 
virus present in the culture was harvested, passaged once in 
LLC-MK2 cultures, and characterized by 

hemagglutination -inhibit ion (HAI) as described (van Wyke 
Coelingh et al., Virol . 143: 569-582, (1985) (incorporated 
herein by reference in its entirety) to determine if it 
possessed the monoclonal antibody resistant mutation (MARM) 
that marked virus recovered from cDNA. 
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D) Sequencing of Recombinant Virus 
The presence of nucleotide sequence markers in the 
HN and L genes of recombinant PIV was determined by RT-PCR of 
RNA isolated from recovered virions. 1 ml of rPIV (1 X 
5 lO^pfu/ml, passage level 2) was precipitated with 200^x1 25% 
polyethylene glycol by incubation on ice for one hour and 
centrifuging at 12,000g for 15 minutes. The RNA was purified 
with TRIzol reagent (Life Technologies) following the 
manufacturer's recommended procedure. RT-PCR was performed 

10 with the Superscript kit (Life Technologies) following the 

manufacturer's recommended protocol. Control reactions were 
identical except that reverse transcriptase was omitted from 
the reaction to confirm that the PGR products were derived 
solely from virus RNA and not from possible contaminating cDNA 

15 plasmids. Four primer pairs were used to generate PGR 

products from nt 7334-8715, 9364-10854,. 10939-15392, and 
13623-15392. The resultant PGR products were then sequenced 
using cycle dideoxynucleotide sequence analysis (New England 
Biolabs, Beverly, MA) . 

20 

EXAMPLE VII 

Recovery of Recombinant Virus From cDNA Encoding 
Negative -Sense Genomic RNA 

25 Plasmid p218(131) and the three support plasmids 

pTM(N), pTM(P), and pTM(L) were transfected into HEp-2 cells 
with MVA expressing T7 RNA polymerase. A control group 
consisting of pTM(N), pTM(P), pTM(L), and MVA was 
cotransfected into HEp-2 cells without p218(131). On day 

30 four, the transfection was harvested, passaged onto fresh 

HEp-2 cell monolayers for five days, and passaged again for 5 
days in LLG-MK2 cultures (passage 2) . Virus present in the 
passage 2 harvest was further characterized by HAI. Cultures 
from the transfection group which received the three support 

35 plasmids without the full-length genomic clone p218(131) did 

not yield HPIV3 . The rPIV recovered virus was confirmed to be 
HPIV3 since it reacted in the HAI assay with the mAbs 77/5, 
101/1, and 454/11 which are specific for HPIV3 (Table 1). It 
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was presumptively identified as being cDNA-derived because it 
failed to react with mAbs 170/7 and 423/6, consistent with the 
MARM mutation which had been introduced into the cDNA. 
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To confirm that rPIV was indeed recovered from cDNA, 
it was analyzed in parallel with wild- type JS strain HPIV3 by 
RT-PCR using four primer pairs flanking the seven inserted 
marker mutations. Each PGR product obtained was dependent 
5 upon the inclusion of RT, indicating that each was derived 

from RNA and not from contaminating cDNA. Cycle -sequencing of 
the four PGR products confirmed that the sequence of the rPIV 
contained each of the seven markers, sequencing data showing 
three of the markers is illustrated in Figure 11 . The 

10 sequence differences between rPIV and JS wt in the HN gene, 
including nt 7903, 7913, and 7915 are readily apparent. 
Similar sequence analyses were carried out for the other four 
markers at nt positions 7593, 10355, 11333, and 15248, and the 
rPIV possessed each mutation. 

15 These results demonstrate successful recovery of 

infectious rPIV from cDNA encoding a negative-sense genomic 
RNA. This differs from most published reports for recovery of 
nonsegmented negative strand RNA viruses, in which the cDNA 
used for virus recovery had been designed to encode 

20 positive-sense antigenomic RNA (Baron and Barrett, supra . 

1997; Gollins et al., supra . 1995; Conzelmann, supra . 1996; 
Garcin et al., supra , 1995; Lawson et al., supra , 1995; 
Radecke et al., supra , 1995; Whelan et al., supra . 1995). The 
recovery of infectious virus from a cDNA encoding genomic RNA 

25 had previously been reported only in the case of Sendai virus, 
and the efficiency of recovery was much lower than for cDNA 
encoding antigenomic RNA (Kato et al., 1996). In most other 
studies, the recovery of virus was achieved with antigenomic 
cDNA but not with genomic cDNA (Lawson et al., 1995; Whelan 

30 et al., 1995). A number of potential problems have been 

noted which may explain these refractory results, including 
possible annealment of cDNA-encoded genomic RNA with mRNA 
produced by the support plasmids; resulting in inactive 
hybrids (Conzelmann, supra . 1996; Lawson et al., supra . 1995). 

35 It has also been noted that T7 RNA polymerase appears to 

terminate preferentially at the gene junctions of genomic RNA, 
perhaps because the oligo U tract of the GE signal resembles 
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the natural signal for transcription termination by T7 RNA 
polymerase (Whelan et al., supra . 1995). 



5 EXAMPLE VII 

Recoveirv of Recombinant Virus From cDNA Encoding 
Positive -Sense Antiaenomic RNA 



As described in more detail above, p3/7(131) and 

10 p3/7(131)2G were constructed to encode a positive -sense, 
antigenome that give rise to recombinant PIV, Plasmid 
p3/7(131)2G is identical to p3/7(131) but for the addition of 
two G residues between the T7 promoter and the first 
nucleotide of the antigenome. The addition of two G residues 

15 between the T7 promoter and the first HPIV3 nucleotide 

p3/7{131)2G is based on the preceding examples demonstrating 
that the presence of the two added G residues (as opposed to 
O, 1 or 3 added G residues) yielded substantially increased 
levels of minireplicon replication. 

20 The two antigenome cDNAs [p3/7(131) and p3/7(131)2G] 

were transfected separately into cells together with the N, P 
and L support plasmids, and were infected simultaneously with 
the MVA-T7 recombinant virus using the same procedure 
described above for p218(131) . Infectious virus from each 

25 antigenomic cDNA was recovered and was presumptively 

identified as being cDNA-derived by its inability to react 
with mAbs 423/6 and 170/7. 

The efficiency of virus recovery was evaluated for 
each of the antigenome cDNAs versus the genome cDNA p218(131) . 

30 Twelve transfection reactions using the negative -sense genome 
cDNA p218(131) (SEQ ID NO: 1) were conducted in parallel with 
twelve transf ections using the positive-sense antigenome cDNA 
p3/7(131)2G (SEQ ID NO: 15) to compare efficiency of virus 
recovery from the two cDNAs. One ml of the transfection 

35 harvest from each well was titered on LLC-MK2 cells and the 
remaining 2mls were passaged (passage 1) onto fresh LLC-MK2 
cells. At day five, passage 1 was harvested and titered as 
described above. Recombinant virus was recovered from 12/12 
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wells transfected with p3/7{131)2G but from only 4/12 wells 
transfected with p218(131). The mean titer of virus present 

in culture of the passaged virus from the positive-sense 
antigenome was 10^°, nearly ten-fold higher than that from the 
negative-sense genome, which was 10*'\ However, with one 
additional amplification the titers became equivalent. 
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The efficiency of recovery of recombinant virus from 
the three full-length plasmids encoding the genomic or 
antigenomic HPIV3 RNAs was next studied, (i) to determine 
whether genomic or antigenomic cDNA is more efficient at 
generating recombinant virus and (ii) to determine the effect 
of two extra 5' terminal G residues on the yield of 
recombinant viruses (Table 2.) Unfortunately, it was not 
possible to directly titer the transfection harvest by plaque 
titration because residual MVA-T7 interfered with plaque 
formation of rJS on LLC-MK2 monolayer cultures. Therefore, we 
first compared the efficiency of recovery of rJS from multiple 
independent transf ections and, as seen in other experiments, 
found that rJS was more frequently recovered from cultures 
transf ected with p3/7(131)2G than from cultures transfected 
with p218{131) . Since each transfection harvest, including 
those with p218(131) yielded rJS in other experiments, it was 
not possible to use this method of analysis to compare the 
relative efficiency of generating recombinant virus. We 
therefore estimated the relative quantity of virus present in 
the transfection harvest by quantitating the amount of virus 
following one passage of the transfection harvest (Table 2) . 
Since each plasmid was designed to yield an identical virus 
(rJS) , differences in titers of the passaged virus were 
considered a reflection of differences in the titer of virus 
present in transfection harvest. In the above experiments, 
the construct with two 5' terminal G residues was the most 
efficient in generating rJS. This indicated that the two 
added residues indeed increased efficiency of recovery, but 
the mechanism for this small increment in efficiency was not 
defined. These findings indicate a small, but measurable, 
advantage of using p3/7(131)2G as substrate for future 
experiments designed to introduce mutations into the HPIV3 
cDNA. The small advantage in recovery of recombinant virus 
from p3/7{131)2G might be especially needed to recover viruses 
that have attenuating mutations that restrict replication in 
vitro. Although VSV has been recovered from antigenomic cDNA 
which included three G residues at the 5' -terminal end, the 
efficiency of recovery of virus from this construct was not 
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compared with recovery from the same construct lacking the 
extra residues (Whelan et al . , Proc . Natl . Acad . Sci . USA 
92:8388-8392 (1995), incorporated herein by reference). In 
contrast to the experience with Sendai virus and measles 
5 virus, it is clear that extra-viral G residues at the 5'- 

terminal end of the antigenome are not deleterious to recovery 
of recombinant virus, and, in fact, appear to be advantageous 
[Garcin et al., supra . 1995; Radecke et al., supra . 1995; Kato 
et al., 1996). The genomic and antigenomic plasmids without 

10 two 5' -terminal guanine residues appeared equally efficient in 
generating recombinant virus. This is in contrast to previous 
reports which have found genomic cDNA to be less efficient 
than antigenomic cDNA in generating recombinant virus (Whelan, 
1995; Kato et al . , 1996). 

15 The efficiency of recovery of virus from p3/7(131) 

(SEQ ID NO: 14) was compared to that of p3/7(131)2G (SEQ ID 
NO: 15 and p218(131) (SEQ ID NO: 1) in a similar experiment as 
above. Recovery from p3/7(131) was comparable to that of 
p217(131). Recovery was more consistent and efficient with 

20 both antigenome cDNAs containing the extra Gs (Table 2) . 

EXAMPLE VII 

Efficiency of plaque formation of rPIV at permissive and 
restrictive temperatures 

25 

The level of temperature sensitivity of replication 
in vitro of control viruses and rPIV derived from both 
p218(131) and p3/7(131)2G was determined at 32*C, 37*C, 39*C, 
and 40 *C in LLC-MK2 monolayer cultures as previously described 

30 (Hall et al . , Virus Res . 22 : 173-184, (1992), incorporated 
herein by reference in its entirety) , with the following 
modifications. Viruses were inoculated onto LLC-MK2 
monolayers on 24 -well plates in serial 10 -fold dilutions 
allowed to adsorb for one hour at room temperature. The 

35 cultures were then overlaid with 1ml of L-15 supplemented with 
2mM glutamine and 0.8% methylcellulose and the plates were 
then incubated for 5 days at the indicated temperature. The 
methylcellulose overlay was removed and the monolayer fixed 
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with 80% methanol at 4'C for Ih. The viral plaques present in 
the monolayer were scored using a mixture of two 
HPIV3 -specif ic anti-HN murine mAbs 101/1 and 66/4 as ascites 
fluid used at a dilution of 1:500, using an immunoperoxidase 
5 method of staining specific for murine antibodies as 

previously described (Murphy et al., Vaccine 8: 497-502 
(1990) , incorporated herein by reference in its entirety) . 

Recombinant virus derived from either positive or 
negative -sense cDNA were characterized by plac[ue assay at 

10 22^C, 37°C, 39*^C, and 40^C to determine if they were 

phenotypically similar to JS wt virus. Both positive and 
negative -sense rPIV were comparable to the JS wt virus in 
their level of replication at elevated temperatures of 39 *C 
and 40°C (Table 3) . This is , in contrast to the ts mutant JS 

15 cp45 which exhibits a 30-fold reduction in titer at 37®C and 
fails to produce plaques at 39^C or 40®C. 

TABLE 3 

20 The rJS Resembles its Biologically Derived Parent 

JS Wild-Type Virus in the Level of Replication at 
Restrictive Temperature (39®C - 40^C) 



30 







Virus 


Titer (log 


10 pfu/ml) 


Virus 


320C 


370c 


39<^C 


40*»C 


rJS-PS^ 


6.1 


6.1 


6.1 


6.6 


rJS-NS^ 


6.9 


7.1 


7.1 


7.0 


JScp45^ 


6.3 


4.3 


<0.7 


<0.7 


JS wt 


6.5 


6.8 


6.6 


6.7 



Recombinant virus derived 
clone p3/7(131)2G 

Recombinant virus derived 
p218 (131) 



from the antigenomic- sense 
from the genomic- sense clone 



45 



3 



JScp45 is a temperature sensitive mutant derived from JS 
wt . 
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The sequence of JS cp4 5 has been fully determined 
(Stokes et al . , supra . 1993) and mutations have been 
identified in the leader, N, P, M, F, HN, and L genes. 
However, it is unknown which mutation (s) are responsible for 
5 the ca, att, or ts phenotypes. Because exemplary rPIV of the 
invention demonstrate the ts* phenotype like the JS wt parent, 
cp45 mutations among other mutations known or yet to be 
discovered for PIV can be introduced, alone or in combination, 
into the full-length cDNA to pinpoint the effects of 

10 individual mutations or combinations thereof, e.g., by 

evaluating replication of the recombinant virus incorporating 
the mutation (s) of interest at elevated temperatures. The 
mutation (s) thus identified can be incorporated into 
effective, attenuated vaccine agents as described in the 

15 Examples below. These and other mutations incorporated into 

recombinant PIV can be further optimized by, e.g., mutagenesis 
to create two or more nucleotide substitutions per codon to 
render a recombinant virus that is more genetically stable 
than a biologically derived mutant strain. 

20 

EXAMPLE VIII 

Replication of rPIV in Hamsters 

25 Thirty-six 16 -week-old golden Syrian hamsters were 

divided into four groups of nine and inoculated intranasally 
with 0.1ml containing 10^-^ pfu of either rPIV recovered from 
negative -sense cDNA, rPIV recovered from positive-sense cDNA, 
JS cp45, or JS wt virus. On day 4, the hamsters were 

30 sacrificed and the lungs and nasal turbinates harvested. The 
lungs were homogenized in a 20% w/v L-15 suspension containing 
2.5/ig/ml amphotericin B (Quality Biologicals, Gaithersburg, 
MD) , 200/ig/ml pipericillin (Lederle Laboratories, Pearl River, 
NY) , and 50/xg/ml gentamicin (Quality Biologicals) . The nasal 

35 turbinates similarly were homogenized in a 10% w/v L-15 

suspension. After homogenization, the samples were aliquoted 
and rapidly frozen in a dry ice-ethanol bath. Virus present 
in the samples were titered at a later date in 96 well plates 
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of LLC-MK2 cells at 32 scoring CPE at five and seven days 
after inoculation. The mean log^o TCIDgo/gm was calculated for 
each group of nine hamsters . 

Table 4 illustrates that rPIV recovered from 
negative-sense cDNA, rPIV recovered from positive-sense cDNA 
replicate to substantially the same level as the JS wt in the 
upper and lower respiratory tract of hamsters. This is in 
contrast to the JS cp45 virus, which is attenuated at each 
site . 



TABLE 4 

The rJS Resembles its Biologically Derived Parent 
JS wt Virus in the Level of Replication in the 
Upper and Lower Respiratory Tract of Hamsters. 





Mean 


Virus Titer (log^o TCIDgo/g)^ 


Virus 


Nasal 


Turbinates Lungs 



rJS-PS^ 6.6±0.2 4.1±0.3 

rJS-NS^ 6.4±0.1 4.2+0.2 

JScp45^ 4.2±0.2 ^1.4+0.0 

JSwt* 6.3±0.2 4.6±0.3 

^ Recombinant virus recovered using p3/7(131)2G encoding 
the positive -sense HPIV3 antigenome. 

^ Recombinant virus recovered using p218(131) encoding the 
negative -sense HPIV3 genome, 

^ Biologically derived ts mutant. 

^ Biologically derived parent virus. 

^ Mean titers ± standard errors for nine hamsters per 
group . 



Thus, exemplary rPIVs of the invention can retain 
the replicative capacity in hamsters exhibited by the 
biologically derived JS wt parent strain, whereby mutations 
such as those present in the JS cp45 candidate vaccine that 
restrict replication in hamsters and other hosts, including 
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non-human primates and humans, can be identified and 
incorporated within modified rPIV strains of the invention, as 
described in the Examples below. 

5 

EXAMPLE IX 

Identification of Amino Acid Sxibstitutions in HPIV3 Specifying 
Attenuated Phenotvpes^ and Incorporation of Attenuating 
Mutations into Infections > Attenuated PIV Clones 

10 The ability to generate infectious PIV from cDNA 

facilitates development of live -attenuated parainfluenza virus 
vaccines. More specifically, by using the methods and tools 
disclosed herein the genetic basis of attenuation of PIV 
candidate vaccines can be readily determined, and infectious 

15 PIV vaccines produced from cDNA can be designed to achieve a 
finely calibrated level of attenuation and immunogenicity . 

In addition, the tools and methods of the invention 
provide for vaccine development for all three human 
parainfluenza viruses, HPIVl, HPIV2 and HPIV3 that are most 

20 important in human disease. For example, to produce and 

select effective HPIV3 vaccine agents within the invention, 
mutations associated with desired phenotypes of biologically 
deriving HPIV3 candidate vaccines or the attenuated BPIV3 
virus, e.g. attenuating mutations, can be identified and 

25 incorporated into rPIV. Applying these methods, attenuating 
mutations from a large menu of candidate mutations are 
selected and combined to generate rPIV having a desired 
balance between attenuation and immunogenicity, and which 
retain the attenuation phenotype following replication in 

30 humans. 

In the present example, the genetic bases of 
temperature -sensitive (te) and in vivo attenuation (att) 
phenotypes of the PIV3 JS cp45 live-attenuated virus are 
described. Seven exemplary recombinant PIV3 viruses (three 
35 single-, three double-, and one triple-lesioned virus) were 
recovered from full-length antigenomic cDNA and analyzed for 
their ts and att phenotypes. These recombinants bore one or 
more amino acid substitution mutations present in the L gene 
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of JS cp45 (alternatively referred to herein as cp45) , adopted 
within a cDNA clone of the JS wt parent. These three 
exemplary, biologically derived mutations are all present in a 
representative strain of JS cp45 grown in Vero cells, 
5 designated JS cp45 Vero, deposited on August 21, 1997 under 
the terms of the Budapest Treaty with the American Type 
Culture Collection (ATCC) of 123 01 Parklawn Drive, Rockville, 
Maryland 20852, U.S.A., and granted the accession number ATCC 
VR 2588. 

10 Analyses of exemplary PIV recombinants, presented 

below, demonstrate that each of the three exemplary mutations 
in L (Tyr942 to His, Leuggj to Phe, and Thrisgg to lie) contribute 
to the ts and att phenotypes of cp45 and are useful for 
generating of recombinant vaccine virus. 

15 

Viruses and cells. 

The PIV3 JS wt and cp45 viruses were grown in 
LLC-MK2 cells as described previously (Hall et al . , Virus Res . 
22:173-184 (1992), incorporated herein by reference). The 

20 VTF7-3 recombinant vaccinia virus is described in Fuerst et 
al., Virology 225: 419-422 (1996) and the modified vaccinia 
virus Ankara (MVA) which expresses the T7 polymerase is 
described in Wyatt et al., Virology 210:202-205 (1995) (each 
incorporated herein by reference) . HEp-2 (ATCC CCL 23) and 

25 LLC-iyiK2 (ATCC CCL 7.1) cells were maintained in OptiMEM (Life 
Technologies) supplemented with 2% PBS and gentamicin sulfate 
(50ug/mL) . 

Construction of point mutations in the L gene of 

PIV3. 

30 pUC19 was modified to accept a fragment of the JS wt 

PIV3 L gene in order to introduce point mutations into the L 
gene by site-directed mutagenesis. First, a unique Nhe 1 
restriction site was introduced into pUC19 by ligating a pair 
of complementary oligonucleotides (5' GATCGATGCTAGCCC 3' (SEQ 

35 ID NO: 23) and 5' GATCGGGCTAGCATC 3' (SEQ ID NO: 24)) 

containing an Nhe 1 restriction site into the Hind III site of 
pUC19 to create pUC19 (N) . The Sph 1 (PIV3 nt 11317) to Nhe I 
(PIV3 nt 14087) fragment of pTM(L) , which includes the 
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positions where the three coding changes in cp45 occur and 
which can be directly introduced into the full-length PIV3 
cDNA (see below) , was cloned into the Sph I and Whe I site of 
pUC19 (N) to create pUCL(N-S) . Point mutations were 
5 introduced into pUCL(N-S) using mutagenic oligonucleotides 

with the Transformer mutagenesis kit (Clontech, Palo Alto, CA) 
for the purpose of (i) creating exemplary amino acid 
substitutions at L protein positions 942, 992, and 1558, 
individually and in combination, and (ii) ablating one 
10 specific naturally-occurring restriction enzyme site proximal 
to each codon substitution as a marker [See Table 5] . 
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Mutations introduced in pUCL{N-S) derivatives were verified by 
dideoxynucleotide sequencing of plasmid DNA. The Sph I to 
BawRl (nt 13733) fragment of pUCL(N-S) containing the cp45 
individual L gene mutations was subcloned into the Sph I to 
5 BaniRl sites of pTM(L) to give p™(L)-942, -992, -942/992, and 
-1558; the other double and triple mutations were assembled 
using the Pin AI and Nhe I sites (Fig. 12) . The mutant pTM(L) 
plasmids were each tested at permissive temperature (32°C) for 
the ability to direct the expression of the chloramphenicol 

10 acetyl transferase marker gene in a minireplicon system 

comprising a plasmid- encoded minigenome RNA and the N, P and L 
proteins (Durbin et al., Virolocrv 234:74-78 (1997), 
incorporated herein by reference) . The various mutant L 
plasmids supported marker gene expression to 75-106% the level 

15 of wt L, indicating that each engineered cDNA was free of 

significant spurious mutation (not shown) . The Sph I to Nhe I 
fragments of each of the mutant pTM{L) plasmids were then 
subcloned into the Sph I to Nhe I window of the full-length 
PIV3 JS antigenomic cDNA p3/7(131)2G to create seven 

20 full-length PIV3 cDNA clones representing every possible 
combination of the three substitutions. 

Recovery o£ recoxnblnant PIV3 (rPIVB) bearing one, 
two or three cp45 L protein substitutions. 

Each full-length antigenomic cDNA bearing one or 

25 more cp4 5 L gene mutations, together with the three support 

plasmids pTM(N), pTM(P) and pTM(L), was transfected into HEp-2 
cells on 6-well plates (Costar, Cambridge, MA) using 
LipofectACE (Life Technologies) and MVA-T7 as described above. 
After incubation at 32*^0 for 4 days, the transfection harvest 

30 was passaged onto HEp-2 cells on 6-well plates which were 

incubated at 32°C for 4 days. Each passage 1 supernatant was 
harvested and passed onto a T-25 flask of LLC-MK2 cells, which 
was incubated at 32®C for 5-6 days. The passage 2 supernatant 
was harvested and the presence of recombinant virus was 

35 initially confirmed by immunoperoxidase staining of virus 

plaques (Murphy et al . , Vaccine 8:497-502 (1990), incorporated 
herein by reference) with anti-HN monoclonal antibody (Mab) 
77/5, which binds to both biologically derived and recombinant 
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JS PIV3, and Mab 423/6, which does not bind to cDNA-derived 
virus because its epitope was ablated to serve as a marker. 
Virus present in passage 1 was subjected to two or three 
rounds of plaque purification on LLC-MK2 cells as described 
5 previously. Each biologically cloned recombinant virus was 

amplified twice in LLC-MK2 cells at 32®C to produce virus for 
further characterization. Virus was concentrated from 
clarified medium by polyethylene glycol precipitation, and 
viral RNA (vRNA) was extracted with Trizol Reagent (Life 
10 Technologies) . Reverse transcription (RT) was performed on 

vRNA using the Superscript II kit with random hexamer primers 
(Life Technologies) . The Advantage cDNA PGR kit (Clontech, 
Palo Alto, CA) and sense (5' nt 

11190 -GCATTATCTAGATGTGTCTTCTGGTCAGAG 3' nt -11219) (SEQ ID NO: 

15 31) and antisense (5' nt 14140-CCTGAATTATAATAATTAACTGCAGGTCCT 
3' nt-14111) (SEQ ID NO: 32) primers specific for the PIV3 L 
gene were used to amplify the region spanning the Sph 1 to Nhe 
I fragment. The PGR fragments were analyzed by digestion with 
each of the restriction enzymes whose recognition sites had 

20 been ablated during insertion of the three cp45 mutations in L 
(see Table 5) . 

Efficiency of plaque forxnation (EOP) at permissive 
and restrictive temperatures of rPIVB bearing one, two or 
three cp45 L protein amino acid substitutions. 

25 The level of temperature sensitivity of plaque 

formation in vitro of control and recombinant viruses was 
determined at 32^C, 37°C, 38^C, 39°C, 40^C, and 41«C in LLC-MK2 
monolayer cultures and plaques were enumerated by 
hemadsorption with guinea pig red blood cells following 

30 removal of the methylcellulose overlay. Alternatively, viral 
plaques present in the monolayer were identified by 
immunoperoxidase staining with a mixture of two PIV3 -specific 
anti-HN murine xtAha 101/1 and 454/11 diluted 1:500, (Murphy et 
al., suprc^, (1990)) . 

35 Hamster studies* 

4 to 16 week-old golden Syrian hamsters in groups of 
six were inoculated intranasally with 0.1ml OptiMEMl per 
animal containing 10^*^ pfu of rPIV3 JS wt, PIV3 cp45 virus, or 
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one of the rPIV3 containing one or more cp45 L protein 
substitution (s) . On day 4 post -infection, the hamsters were 
sacrificed, the lungs and nasal turbinates were harvested, and 
the virus was quantified as described above. The mean log^o 
5 TCIDso/g was calculated for each group of six hamsters . 

RESULTS 

Introduction of the PIV3 cp45 L protein amino acid 
substitutions into wt JS rPIV3 . 

As noted above, the three amino acid substitutions 

10 present in the L protein of cp45 (Table 5) were introduced 

individually or in selected combinations into the antigenomic 
cDNA that encodes its wt parent, PIV3 JS strain. Each 
introduced mutation was engineered to be marked with a silent 
mutation that ablated a proximal, naturally-occurring 

15 restriction enzyme site to facilitate monitoring the mutation 
in recovered rPIVS (Table 5, Fig. 12) . The coding change at 
amino acid 1558 was designed to contain two nucleotide changes 
in rl558, compared to the one nt substitution in cp45, to 
reduce the chance of reversion at this site during in vitro or 

20 in vivo replication. 

Seven rPIV3s bearing one, two or all three of the 
amino acid substitutions from cp45 were recovered in tissue 
culture by transfection of each antigenomic cDNA together with 
the pTM(N), pTM(P) and pTM(L) support plasmids and coinfection 

25 with the vaccinia virus MVA/T7 pol recombinant (Wyatt et al . , 
supra . (1995) ) . Each rPIV3 possessed the Mab resistance marker 
that had been deliberately introduced into the HN gene by 
engineering the antigenomic cDNA. The rPIV3s were 
biologically cloned by two or three cycles of plaque to plaque 

30 passage to ensure that each virus preparation was genetically 
homogeneous. This precaution was taken because vaccinia virus 
can meditate recombination between the antigenomic cDNA and 
the support plasmids. 

To confirm that each of the seven rPIV3 contained 

35 the engineered mutation (s) in the L gene, RNA was purified 
from precipitated virions and was copied into cDNA and 
amplified by RT-PCR. Control reactions showed that the RT 
step was required for generation of RT-PCR products. 
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indicating that an RNA template rather than contaminating cDNA 
was required for the generation of the RT-PCR product. The 
RT-PCR products were subjected to digestion with the three 
restriction enzymes whose recognition sequences had been 
5 ablated as markers for the inserted coding changes . As 

expected, the RT-PCR product of JS wt rPIV3 was cleaved the 
appropriate number of times by each of the three enzymes, 
whereas r942/992/1558 lacked each of the three sites ablated 
during creation of the individual cp45 coding changes. Each 

10 of the other rPIV3s lacked the appropriate restriction 

site(s), indicating the presence of the introduced mutations. 

E££iciency of plaque formation at 32^C, 37^C, 38^C, 
39^C, 40^C, and 41*'C of rPIV3 bearing cp45 L mutations. 

The seven rPIV3s bearing the various combinations 

15 ofcp45 L protein amino acid svxbsti tut ions were assayed for 

their ability to form plaques on LLC-MK2 monolayers at 32°C, 
37<^C, 38^C, 39°C, 40°C and 41°C. As shown in Table 6, each 
rPIVB bearing a cp45 aa substitution was ts, whereas the JS wt 
rPIV3 parent was not restricted in plaque formation at any 

20 temperature tested. The shut-off temperature of plaque 

formation of r942, r992 and rl558 was AO^C. r942 manifested a 
700-fold reduction of plaque formation at 40®C, indicating 
that its replication was marginally reduced at this 
restrictive temperature. However, the plaque size of r942 

25 also was greatly reduced at 40^C, which also indicates that 

its replication was restricted at this temperature compared to 
the JS wt. r942 was completely restricted in replication at 
41 °C (data not shown) . r992 and rl558 were greatly reduced 
(over a 1,000,000-fold reduction) in plaque formation at 40°C. 

30 These results indicate that each of the three cp45 L gene 

mutations individually specifies the ts phenotype, although 
that of the r942 mutation is somewhat less restrictive. The 
double mutant virus r942/1558 and the triple mutant 
r942/992/1558 had a shut-off temperature of 39*^0, while that 

35 of r942/992 and cp45 was 38«C. The double mutant, r992/1558, 
was less ts than the r992 single mutant. r992/l558, like 
r942, was completely restricted in plaque formation at 41°C. 
The level of temperature sensitivity exhibited by double or 
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triple results from a rPIVs delicate interplay of the three 
mutations that cannot be predicted from level of temperature- 
sensitivity exhibited by the single mutants. Also, since 
r942/992/l558 was slightly less ts than cp45, other mutations 
outside of the L gene also likely contribute to the ts 
phenotype of cp4 5, therefore representing additional mutations 
of interest within the invention. 

Table 6: The efficiency of plaque formation (EOP) at 32**C, 
37^C, 38*>C, 39«C and 40°C of rPIV3 bearing one, two or three 
cp^S L protein amino acid substitutions. 



virus® 


pf u/ml ) 




Virus 


titer (logio 




32*>C 


37*>C 


38°C 


39*>C 


40OC 


r942 


6.8 


6.8^ 


6.6^ 


6.5*^ 


4.3^ 


r992 


6.9 


6.8** 


6.7^ 


6.1^ 


<0.7 


rl558 


6.6^ 


6.6** 


6.4^ 


5.0»^ 


<0.7 


r942/992 


6.7 


6.5*> 


4^^ 


3.0^ 


<0.7 


r942/l558 


5.2 




4.0*^ 


1.0^ 


<0.7 


r992/l558 


6.7 


6.7^ 


6.5^ 


5.9^ 




r942/992/1558 


6.6 


6.2^ 


6.2^ 


2.7^ 


<0.7 


cp45*' 


6.6 


4.8 


4^^ 


<0.7 


<0.7 


rPiV3 JS 


8.3 


8.4 


8.5 


8.5 


8.3 



a. The cp45 virus is a biologically derived virus, and each 
of the other viruses tested is a recombinant. 

b. Plaques were of pinpoint size. 

c. Underlined numbers represent the shut-off temperature of 
plaque formation, which is defined as the lowest restrictive 
temperature at which a 100-fold reduction in titer is 
observed compared to the titer at 32**C. 



Growth in hamsters. Groups of six Golden Syrian 
hamsters were inoculated intranasally with JS wt rPIV3, 
biologically-derived cp45, or with rPIV3 containing one or 
more cp45 L protein amino acid substitutions, and virus 
replication in the lungs and nasal turbinates was determined. 
In this experiment [Table 7] , each of the rPIV3s bearing a 
single amino acid substitution was restricted in replication 
in the upper and lower respiratory tract [Table 7] . However, 
r942, the least ts virus, was only marginally suppressed in 
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replication in the upper and lower respiratory tract in a 
second experiment . These data demonstrate that two of the 
three amino acid substitutions contribute to the att phenotype 
when present as single-lesioned recombinant viruses. However, 
5 the 942 mutation indeed contributes to attenuation (e.g., the 
r942/992 is more attenuated than r992 alone) . Thus, each of 
the amino acid substitutions in L contribute to the att 
phenotype either acting alone or in concert with another L 
amino acid mutation. Each of the double mutants was 

10 attenuated indicating that loss of any of the three L gene 
substitutions following replication in vivo still leaves an 
attenuated virus. This is a partial explanation for the 
previously observed high level of stability of the ts 
phenotype of the cp45 following replication in vivo . The 

15 triple mutant r942/992/l558 was as restricted as cp45 for 
replication in the upper and lower respiratory tract 
indicating that the three amino acid substitutions in the L 
protein are the major contributors to the att phenotype of 
cp45. 
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Table 7: The level of replication in the upper and lower 
respiratory tract of hamsters of rPIV3 bearing one, two or 
three cp45 L protein amino acid substitutions, compared to JS 
wt rPIV3 and cp45^ 



Virus 


Mean virus titer 


(logio TCIDgo/g ± S.E°) 




Nasal turbinates 


Lungs 


rFivj wt 


7.4 ± .16 


5.1 ± .49 


r942 

X ^ ^ A 


M 17 
O • D X • -i- / 




r992 


4.4 ± .16 


3.1 ± .11 


rl558 


3.8 ± .40 


4.3 ± .34 


r942/992 


<1.5 ± 0 


<1.5 ± 0 


r942/l558 


2.9 ± .23 


1.8 ± .17 


r992/l558 


5.7 ± .16 


3.2 ± .57 


r942/992/l5 
58 


3.9 ± .15 


<1.5 ± 0 


cp45 


4.1 ± .27 


1.6 ± .08 



Groups of six hamsters each were intranasally 
administered 10^-^ pfu of virus per animal in 



20 an 0.1ml inoculum, and lungs and nasal 

turbinates were harvested four days later. 

b. Standard Error 

c. cp45 is a biologically derived virus and the 
others are recombinant . 

25 

To summarize the above results, substitutions at L 
protein amino acid positions 992 and 1558 each specified a 
1,000,000-fold reduction in plaque formation in cell culture 
at 40®C, while the substitution at position 942 specified a 

30 700-fold reduction. Thus, each of the three mutations 

individually contributes to the ts phenotype. The triple 
recombinant which possesses all three L mutations is slightly 
less ts than cp45, suggesting that there are mutations outside 
of the L gene in cp45 that also might contribute to its ts 

35 phenotype. Two of the three individual mutations in L each 
contributed to restricted replication in the upper or lower 
respiratory tract of hamsters, which accounts for the observed 
stability of ts and att phenotypes of cp45 during replication 
in vivo. Importantly, the level of temperature sensitivity of 

40 recombinant vaccine strains in vitro was closely predictive of 
attenuation in vivo. The recombinant virus possessing all 
three mutations was as restricted in replication as the cp45 
mutant in both the upper and lower respiratory tract of 
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hamsters, indicating that the L gene of the cp45 virus is a 
major attenuating component of this candidate vaccine strain. 
While each mutation on its own specifies the ts phenotype, 
when placed together they are not simply additive but instead 
5 somehow influence each other. The effect of the three 

mutations together in the triple mutant seemed to ameliorate 
rather than enhance the level of temperature-sensitivity 
observed in the two double mutants which were evaluated. 
Interestingly, this should provide an unanticipated selective 

10 pressure to maintain at least some of the cp45 L mutations, 
since the loss by reversion of either the 992 or 1558 
substitution would increase rather than decrease the level of 
temperature sensitivity. Considered together, these findings 
indicate that the high level of the stability of the ts and 

15 att phenotypes of cp45 virus results from the contribution ,pf 
multiple ts mutations in L to the att phenotype. The 
identification of these three mutations as the major 
attenuating mutations of cp45 provides the basis for 
monitoring virus during all stages of manufacture and 

20 following replication in humans. 

It is of further interest that the tyrosine to 
histidine mutation at position 942, arguably the most 
conservative substitution of the three mutations, was the 
least temperature sensitive. The L polymerase of PIV3 is a 

25 large polypeptide, 2233 aa in length, and is thought to be a 
multifunctional protein that encodes multiple domains 
including those required for complex formation with the P 
protein, RNA binding, RNA polyadenylation, RNA transcription 
and RNA replication (Collins et al., supra . (1996)). The 

30 amino acid substitutions in L at positions 942 and 992 are 
located near regions that are well -conserved among other 
members of the Paramyxovirus family (Blumberg et al , , Virology 
164:487-497 (1982); Galinski et al . , Virolocrv 165:499-510 
(1988) ) . The mutation at position 1558 is in a region of the 

35 polymerase which appears to share less sequence identity with 
other L polymerases. Although the mechanism by which the ts 
phenotype is conferred by the triple amino acid substitution 
in L is not known, it is likely that multiple L protein 
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domains and activities are affected, or that a common 
mechanism involving various activities of L is affected. 



EXAMPLE X 

5 Direct Identification, and Reconstitution into Recombinant 

Vaccine Viruses, of Mutations in a BioloaicallY Derived, 
Live-Attenuated HPIV Type 3 Virus (cp45) Which Specify the 
Temperature- Sensitive, Cold-Adaptation and Attenuation 

Pheno types 

10 The above Examples demonstrate that each of the 

three amino acid substitutions in the L polymerase protein of 
cp45 confer the temperature-sensitive (ts) and attenuation 
(att) phenotypes, but not the cold-adaptation (ca) phenotype 
(see also, Skiadopoulos et al., J, Virol 72 (3) : 1762-8 . 1998). 

15 cp45 contains twelve additional mutations in other proteins 
(N, C, M, F and HN) or potential cis-acting sequences (the 
leader region and the transcription gene start {GS} signal of 
the N gene) , and their contribution to these phenotypes has 
been heretofore undefined. The present Example further 

20 characterizes the genetic basis for the ts, ca, and att 
phenotypes of cp45 to provide yet additional information 
regarding basis for the observed high level of stability of 
these phenotypes following replication of cp45 in humans or 
non-human primates. In one aspect of this study, a 

25 recombinant cp45 (rcp45) virus containing all fifteen 

cp45- specific mutations was constructed, using a reverse 
genetics system, and was found to be essentially 
indistinguishable from the biologically-derived virus on the 
basis of plaque size, level of temperature-sensitivity, 

30 cold- adaptation, and level of replication in the upper and 

lower respiratory tract of hamsters. In addition, recombinant 
viruses containing: (1) the cp45-specif ic changes in the C, M, 
F or HN proteins, (2) the combined leader and N gene 
mutations, or (3) several combinations of the cp45 mutations 

35 were constructed. Analysis of these recombinant viruses 

showed that multiple cp45 mutations distributed throughout the 
genome contribute to the ts, ca, and att phenotypes. The 
mutations in C and F were not ts at 40°C but nonetheless 
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conferred the att phenotype, and they, therefore, are non-ts 
att mutations. The HN mutation did not confer the ca, ts or 
att phenotypes. Viruses possessing the 3" leader and N 
mutations were ts, but exhibited only marginal attenuation in 
5 the lower respiratory tract of hamsters. Recombinants 

possessing several combinations of mutations exhibited a 
greater level of temperature sensitivity than cpAS, but the 
increased level of temperature-sensitivity was not reflected 
in an increase in attenuation in vivo. These latter findings 

10 indicate that the multiple mutations identified in cp45 are 
interacting to affect replication in vitro. The presence of 
multiple ts and non-ts attenuating mutations in cp45 likely 
contributes to its high level of attenuation and phenotypic 
stability. Knowledge of the phenotypes associated with the 

15 various mutations of cp45 provided herein allows for accurate 
monitoring of biologically derived PIV viruses and ready 
manipulation of recombinant virus to achieve a large 
assemblage of useful vaccine recombinants within the 
invention. 

20 Viruses and cells. 

The rPIV3s, PIV3 JS wt and cp45 viruses described in 
the present Example were grown in simian LLC-MK2 cells (ATCC 
CCL 7.1) as described above (see also, Durbin et al . , Virology 
235:323-332, 1997a; Hall et al . , Virus Res. 22 (3) ; 173-184, 

25 1992; Skiadopoulos et al . , J. Virol 72 (3) :1762-8. 1998). The 
modified vaccinia virus Ankara was provided as described 
above. HEp-2 (ATCC CCL 23) and LLC-MK2 cells were maintained 
in OptiMEM I (Life Technologies, Gaithersburg, MD) 
supplemented with 2% FBS and gentamicin sulfate (50ug/mL) , or 

30 in EMEM supplemented with 10% FBS, gentamicin sulfate 

(50ug/mL) , and 2raM glutamine. L-132 cells (ATCC CCL 5) were 
grown in Earl's MEM (Life Technologies) supplemented with 10% 
FBS, 2mM glutamine, 20 mM Hepes, 1 mM non-essential amino 
acids, and 100 units streptomycin-neomycin/ml . 

35 Construction of point mutations in PIV3 • 

Subgenomic fragments of p3/7(131)2G, the antigenomic 
cDNA clone of PIV3 JS wt used above to recover infectious 
virus (see also, Durbin et al., Viroloav 235 :323-332. 1997a; 
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Skiadopoulos et al., J, Virol. 72 (3) :l762-8. 1998), were 
cloned into pUC19 vectors modified to accept these fragments, 
using standard molecular cloning techniques. Point mutations 
corresponding to mutations identified in cp45, as well as 
mutations creating or ablating silent restriction enzyme 
recognition sequences (Table 8) were introduced using the 
Transformer Mutagenesis Kit (Clontech, Palo Alto, CA) as 
described previously. 



TgUt>le 8: Pry3 cp45 mutations introduced into rPrV3 





seq 

id 

no 


region 
a££ect 
ed 


nt 

posit 
ion* 


sequence changes" 


res trie 

tion 
marker^ 


amino 
acid 
substitu 
tion^ 


1 


46 
47 


3- 
leader 


20 


TTGTCTGGGAAT 
TTGCCTGGGAAT 


none 


non- 
coding 


2 


46 
48 


3' 
leader 


20 


TTGTCTGGGAAT 
TTGTTTGGGAAT 


none 


non- 
coding 


3 


46 
49 


3' 
leader 


20 


TTGTCTGGGAAT 
TTGTCTGGTAAT 


none 


non- 
coding 


4« 


50 
51 


3' 
leader 


40 


AACTTTAAATTA 
AACTTTVAAATTA 


-Dra 1 


non- 
coding 


5« 


52 
53 


N 
gene 
start 


60 


TTAAAGACATTG 
TTTAAGACATTG 


none 


non- 
coding 


6« 


54 
55 


H 


390 


GCAGATGTCAAG 
GCAGATGCCAAG 


none 


Val -96 
to Ala 


7° 


56 
57 


N 


1271 


CGAATCTAAAGA 
CGAAGCTAAAGA 


none 


Ser-389 
to Ala 


8 


58 
59 


C 


2076 


GAAATATTOATC 
GAAACATTGATC 


-Ssp I 


Ile-96 
to Thr 


9 


60 
61 


H 


4341 


TCTCTACCCAAC 
TCGTTAACCAAC 


+Hpa I 


Pro-199 
to Thr 




62 
63 


P 


6323 


AGTACAATAGGT 
AGTACTGTGGGT 


+Sca I 


Ile-420 
to Val 


11 


64 
65 


P 


6419 


GCACTTGATCCA 
ACACTGGATCCA 


+Bain HI 


Ala-450 
to Thr 


12 


66 
67 


HN 


7944 


CCATCATTGTTGTTGACAA 
CCATCATTG TOGCTGACAA 


+Bst XI 


Val-384 
to Ala 


13 


68 
69 


L 


11468 


TTACA TGGCCA 
TCACATGGCGA 


-Eae I 


Tyr-942 
to His 


14 


70 
71 


L 


11618 


TTTGGACTGGGC 
TTTTGATTGGGC 


-Bsr I 


Leu- 992 
to Phe 


■ 15' ■ 


72 
73 


L 


13308 


GGTCCTAATACT 
GGGCCTAATATC 


-Ava II 


Thr- 1558 
to lie 



a. Position of the first nucleotide in the PXV3 sequence shown, b. 
Wild type sequence is shown above the mutant sequence. Nucleotide 
changes are underlined. Codon substitutions are in bold font. c. 
Each relevant restriction endonuclease recognition sequence is in 
italics; (+) indicates addition of new restriction endonuclease 
recognition sequence; {-) indicates ablation of a naturally occurring 
restriction endonuclease recognition sequence, d. Mutations are 
indicated as the three letter amino acid assignment of wt, followed 
by the amino acid position, followed by the cp45 assignment. e. 
These mutations were identified by Joanne Tatem (unpublished 
observations), the others were from Stokes et al . , Virus Res. 
30(1) :43-52, 1993. f. Two nucleotides were changed in the indicated 
codon in order to reduce the chance of reversion to wild type 
sequence . 
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After mutagenesis, restriction endonuclease 



fragments were sequenced completely and cloned back into the 
full-length clone, p3/7(131)2G. The 3' leader and N mutations 
were amplified by reverse transcription (RT) -PGR directly from 
5 PIV3 cp45 virion RNA and were cloned into pLeft+2G (see 

above) , or a modified pUC19 vector for further manipulation. 
Combinations of mutations were constructed using standard 
molecular cloning techniques. The full-length plasmid clone 
containing the cp45 mutations, designated pFLCcp45, was 
10 completely sequenced to determine if extraneous mutations had 
been introduced during the cloning process, but none were 
found. 



15 mutations, together with the three support plasmids pTM(N), 
pTM(P no C) and pTM(L) , was transfected into HEp-2 cells on 
6-well plates (Costar, Cambridge, MA) using LipofectACE (Life 
Technologies, Gaithersburg, MD) and MVA-T7 as described above. 
After incubation at 32°C for 4 days, the transfection harvest 

20 was passaged onto LLC-MK2 cells in T-25 flasks which were 
incubated at 32°C for four to eight days. This harvested 
virus was called passage 1 and was subjected to three rounds 
of plaque purification on LLC-MK2 cells as described above. 
Each biologically- cloned recombinant virus was amplified twice 

25 in LLC-MK2 cells at 22^C to produce virus for further 

characterization. Virus was concentrated from clarified 
medium by polyethylene glycol precipitation (see, Mbiguino and 
Menezes, J. Virol Methods 31 :2-3, 1991, incorporated herein by 
reference in its entirety) , and viral RNA (vRNA) was extracted 

30 with Trizol Reagent (Life Technologies) . Reverse 

transcription was performed on vRNA using the Superscript II 
Preamplif ication System (Life Technologies) with random 
hexamer primers. The Advantage cDNA PGR kit (Clontech) and 
sense and antisense primers specific for various portions of 

35 the PIV3 genome were used to amplify fragments for restriction 
endonuclease analysis. The PGR fragments were analyzed by 
digestion with each of the restriction enzymes whose 



Recovery of recombinant PIV3 (rPIVS) . 

Each full-length antigenomic cDNA bearing cp45 
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recognition sites had been created or ablated during 
construction of the mutations (Table 8) . 

Efficiency of plaque formation (EOP) of rPIV3 
bearing cp45 mutations at perxoissive and restrictive 
5 temperatures. 

The level of temperature-sensitivity of plaque 
formation in vitro of control and recombinant viruses was 
determined at 32"C, 35°C, 36°C, 37^C, 28^C, 39**C, 40°C, and 41°C 
in LLC-MK2 monolayer cultures as described above (see also, 

10 Hall et al., Virus Res. 22 (3) :173-184 . 1992, incorporated 

herein by reference in its entirety) . Plaques were enumerated 
by hemadsorption with guinea pig red blood cells following 
removal of the methylcellulose overlay, or alternatively, the 
viral plaques present in the monolayer were identified by 

15 immunoperoxidase staining with a mixture of' two PIV3 -specif ic 
anti-HN murine mAbs lOl/l and 454/11 diluted 1:250 (see, 
Murphy et al . , Vaccine 8 (5) : 497-502 . 1990; van Wyke Coelingh 
et al., Viroloov 143 (2) : 569-582 . 1985). 

Evaluation of rPIV3 mutant viruses for 

20 cold- adaptation phenotype. 

Growth of mutant and wt rPIV3 viruses was determined 
at 32°C and 20*^C on confluent L-132 cell monolayers prepared in 
24-'Well tissue culture plates. Duplicate wells of each of two 
plates were inoculated with 0.2 ml of each mutant or wt rPIV3 

25 virus at a multiplicity of infection of 0.01. After one hour 
adsorption at room temperature, the inoculum was aspirated and 
the monolayers were washed with 1 ml of PBS per well. The 
inoculated cultures were overlaid with 0.5 ml of Earl's MEM 
supplemented with 10% FBS, 2mM glutamine, 20 mM Hepes, 1 mM 

30 non-essential amino acids, and 100 units 

streptomycin-neomycin/ral. One plate was sealed in a 
waterproof pouch (Kapak) and then submerged in a 20°C bath for 
13 days. The duplicate plate was placed at 32*^0 in a COj 
incubator for 3 days. At the end of the incubation period, 

35 virus was harvested by freeze/thawing. The titer of virus 
recovered from each well was determined by plaque assay in 
LLC-MK2 cells at 32°C using hemadsorption with guinea pig red 
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blood cells to visualize plaques. Two wt and two cp45 
reference stocks were used as controls. 
Hcunster studies* 

5 week-old Golden Syrian hamsters in groups of five 
5 were inoculated intranasally with 0 . 1 ml OptiMEM I per animal 
containing 10^° pfu of JS wt rPIV3, PIV3 cp45 virus, or one 
of the mutant rPIV3s. On day 4 post-infection, the hamsters 
were sacrificed, the lungs and nasal turbinates were 
harvested, and the virus was quantified as described above. 
10 The mean logioTCIDgo/gram at 32°C was calculated for each group 
of hamsters. 



RESULTS 

Introduction of the PIV3 cp45 mutations into JS wt rPIV3 . 

15 The fifteen mutations in the 3" leader, the N GS 

signal, and the N, C, M, F, HN and L proteins of cp45 (Table 
8) were introduced into the complete PIV3 antigenomic cDNA by 
site directed mutagenesis or by direct PGR amplification of a 
segment of cp45 cDNA bearing the desired mutations. The 

20 following antigenomic cDNAs were made (see Fig. 13) : (i) rcp45 
3'N, containing the four point mutations of the leader region, 
the point mutation in the N GS signal, and the two amino acid 
changes in the N protein; (ii) rcp45 C, containing the single 
amino acid change in the C protein; (iii) rcp45 M, containing 

25 the single amino acid change in M, (iv) rcp45 F, containing 

the two amino acid changes in F; (v) rcp45 HN, containing the 
single amino acid change in HN; (vi) rcp45 L, containing the 
three amino acid changes in L, as described above; (vii) rcp45 
3''NL, containing the mutations from i. and vi . above; (viii) 

30 rcp45 3'NCMFHN, containing all of the mutations except for the 
three in L; and (ix) rcp45, containing all fifteen cp45 
mutations listed in Table 8. In most cases, each cp45 change 
was engineered to be accompanied by one or more nearby silent 
changes which introduced or removed a restriction enzyme 

35 recognition site (Table 8) . These served as markers to 
confirm the presence of the mutation in the cDNA and in 
recovered virus. Also, two of the amino acid coding changes 
(mutation numbers 10 and 15 in Table 8) were made using two 
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nucleotide changes rather than the single change found in 
cp45, reducing the possibility of same-site reversion to wt. 
The cp45 cDNA, which contains all fifteen of the cp45 changes 
in Table 8, was assembled from the same mutagenized cDNA 
5 subclones as were used to introduce cp4 5 changes into the 

other antigenomic cDNAs; it was sequenced in its entirety, and 
was confirmed to contain only the intended mutations. This 
indicated that all of the regions which had been subjected to 
mutagenesis or PGR and had been manipulated by cloning 

10 possessed the desired sequences and lacked other unwanted 

mutations. Each full-length plasmid bearing one or more of 
the cp45 mutations was transfected into HEp-2 cells along with 
support plasmids and MVA-T7 to produce recombinant PIV3 as 
described above. Analysis of RT-PCR fragments encompassing 

15 the mutations indicated in Table 8 were amplified from virion 
RNA of the various recombinant viruses indicated in Figure 13 
confirmed the presence of the introduced mutations, and other 
unintended mutations were not found. 
Plaque morphology. 

20 Several of the recombinant viruses exhibited 

distinctive plaque morphologies when tested on LLC-MK2 cells. 
JS wt rPIV3 plaques averaged 1mm in size, and were 
indistinguishable in size from the biologically derived JS wt 
PIV3. Plaques of the cp45 and rcp45 viruses were larger than 

25 wt, averaging two- to three-fold larger in diameter than wt, 

and were indistinguishable from each other. This demonstrated 
the comparability of the biologically-derived and recombinant 
cp45 virus for this phenotype. 

Efficiency of plaque fozrmation of rPIV3s bearing the 

30 cp45 mutations in LLC-MK2 cells at permissive and restrictive 
temperatures • 

The rPIV3s were assayed for their ability to form 
plaques at permissive and restrictive temperatures ranging 
from 32**C to 41°C (Table 9) . Analysis of the ts phenotypes of 

35 viruses bearing individual components of cp45 revealed that 

the rcp45 S'N and rcp45 M viruses had a shutoff temperature of 
40°C, and the rcp45 C mutant had a shutoff of 41°C. The 
shutoff temperature of rcp45 F and rcp45 HN mutants was 
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greater than 41°C, Consistent with the above results, the 
rcp4 5 L virus had a shutoff temperature of 39°C. A virus is 
considered to have a ts phenotype, for example, if its 
reduction of replication at 40°C (ie. titer at 32*^C minus titer 
at 40°C) is approximately > 100-fold that of wt virus at 40°C. 
Applying this definition, the present results indicated that 
at least two regions of cp45 and L) contribute to the ts 

phenotype . 
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As shovm in Table 9, rcpAS, containing all of the cp45 
mutations, had a shut-off temperature of 38°C, which was 
identical to that of the biologically-derived cp45. These 
results show that the ts phenotype of cp4 5 was successfully 
reproduced in rcp45. In addition, these results validate the 
sequence analysis of cp45 and the subsequent reconstruction of 
mutations into recombinant virus. 



rcp45 except that it lacks the three L mutations, exhibited a 
shutoff temperature of 36°C. Since the L mutations are known 
to confer temperature-sensitivity individually and in 
combination, it is somewhat paradoxical that rcp45 3'NCMFHN 
was more, rather than less, ts than cp45. This implies that 
there is an interaction of mutations within cp45 whereby 
mutations compensate for each other to give a level of 
temperature- sensitivity which is less than the sum of the 
individual components. 



whether the L mutations interact with the leader and/or N 
mutations, since all of these elements are believed to 
interact during RNA synthesis. This virus had a shutoff 
temperature 36^C , compared to 40°C and 39°C for rcp45 S'N and 
rcp45 L, respectively. These results suggest that there is an 
interaction between the 3'N and L mutations that results in 
augmentation of temperature-sensitivity. These results also 
provide another example in which a subset of cp45 mutations 
specifies a level of temperature sensitivity greater than that 
observed for rcp45 which contains the entire set of mutations 

ca phenotype of rPIVBs bearing cp45 mutations. 

The rPIV3s were analyzed to determine which genetic 
elements of cp45 specified the ca phenotype (Table 10) . It 
was demonstrated above that rcp45 L is ts and att, but not ca. 
This indicates that the genetic elements specifying the 
greater part of the ca phenotype are located outside L, and 
this was confirmed in the present study. Each of the rPIVs 
possessing the 3' leader and N mutations were ca except rcp45 



The rcp45 3"NCMFHN virus, which is identical to 



Virus rcp45 S^NL was constructed to investigate 
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3''NCMFHN, which exhibited an intermediate phenotype. This 
shows that the ca phenotype is specified mostly within the 3^N 
region. The finding that the level of ca of viruses 
containing the 3"N segment is less than that of cp45 or rcp45 
5 indicates that other regions of cp45 contribute to the ca 

phenotype, even though this is not apparent from analysis of 
the other regions individually. The rcp45 3'NL virus is more 
ca than the rcp45 3"N virus, suggesting that the L mutations 
may make a modest contribution. The biologically-derived cp45 

10 and rcp45 viruses exhibit comparable levels of ca, indicating 

that this phenotype, like the plaque size and ts phenotypes, 
was successfully reproduced in the recombinant cp45 virus 
provided herein. Therefore, the ca phenotype, like the ts 
phenotype, is a composite phenotype reflecting individual 

15 contributions to the overall phenotype as well as 

contributions from interacting genetic elements. 

20 Table 10 : Growth o£ wt and mutant PIV38 at 20°C 

compared to 32°C* (cold adaptation phenotype) . 



Virus 20®C 32^C ca Phenotype 



cp45^ 


6 . 72 


8.49 




rcp45 


6.57 


8.35 


+ 


rcp45 3"NL 


5.02 


7.23 


+ 


rcp45 3"N 


4.39 


8.53 


+ 


rcp45 3'NCMFHN 


3.53 


8.08 


+/- 


rcp45 L 


3.18 


7.98 




rcp45 C 


2.52 


8.32 




rcp45 F 


2.47 


8.10 




rcp45 HN 


2.11 


8.01 




rcp45 M 


1.76 


8.21 




PIV3 WT 


2.57 


8.43 





35 a. Virus titer is expressed in logjo PFU/mL. b. cp45 is 

biologically-derived and the other viruses are 
recombinant. The ca phenotype is defined as a greater 
than 10-fold increase in growth at 20°C relative to wt. 

Growth of the rcp45 mutant viruses in hamsters. 

Groups of five Golden Syrian hamsters were 
inoculated intranasally with 10^ TCID50 of recombinant or 
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biologically-derived virus, and the level of virus replication 
in the lungs and nasal turbinates was determined four days 
later (Table 11) . The fourth day post -inoculation has been 
shovm previously to be the peak of virus replication in 
hamsters for both the wt and cp45 viruses (see, Crookshanks 
and Belshe, J. Med Virol 13 (3);243-9, 1984, incorporated 
herein by reference in its entirety) . The rcp45 virus was 
reduced approximately 40-fold in replication in the nasal 
turbinates and 1000 -fold in the lungs, and thus was as 
attenuated as the biologically-derived cp45 virus. These 
results indicate that the attenuation phenotype of cp45 was 
successfully reproduced in its recombinant version. Since 
each phenotype of cp45 was fully reproduced in rcp45, the 
additional five mutations in cp45 that were not included in 
this Example likely made little contribution to the properties 
of cp45. 
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Table 11: Level of replication in the upper and lower 
respiratory tract of hamsters* of wt and mutant PIV3b ^. 

Mean virus titer (logio TCIDgo/g ± 
Virus S.E") in: 



Nasal turbinates Lungs 



JS wt rPIV3 


6.9 


± 


0.2 


5.4 


± 


0.5 


rcp45 3'N 


6.5 


± 


0.2 


3.9 


± 


1.1 


rcp45 C 


4.8 


± 


0.3 


3.1 


± 


0.7 


rcp45 M* 


6.8 


± 


0.2 


6.7 


± 


0.3 


rcp45 F 


4.6 


± 


0.2 


3.4 


± 


0.6 


rcp45 HN 


6.3 


± 


0.2 


5.3 


± 


1.0 


rcp45L 


4.2 


± 


0.1 


2.1 


+ 


0.3 


rcp45 3'NL 


4.7 


± 


0.2 


2.1 


± 


0.3 


rcp45 3''NCMFHN 


5.8 


+ 


0.3 


2.9 


± 


0.9 


rcp45 


5.3 


+ 


0.1 


2.4 


± 


0.2 


cp45 


4.9 


± 


0.4 


1.9 


± 


0.2 



a. Groups of five hamsters each were intranasally administered 
■j^qs.o TCID50 of virus per animal in an 0.1ml inoculum, and lungs 
and nasal turbinates were harvested four days later, b. cp45 
is a biologically-derived virus, the other viruses are 
recombinant, c. TCID50 # 50% tissue infectious dose ± Standard 
Error, d. The virus pool used in this study was found to 
contain a mixed plaque phenotype. The attenuation phenotype 
of this mutant will be reassessed using additional virus 
preparations . 

As demonstrated above, the mutations in the L gene 
of cp45 specify the majority of the attenuation phenotype of 
this virus. In the present Example, the contribution of the 
cp45 mutations outside of L as a group was examined. The 
rcp45 3'NCMFHN mutant was only slightly reduced in replication 
in the nasal turbinates, but was more than 100-fold reduced in 
replication in the lungs, which shows that additional 
attenuating mutations were present outside of the L protein. 
Importantly, if each of the three mutations in the L gene of 
rcp45 reverted to wild type sequence, the resulting virus, 
rcp45 3''NCMFHN, would still retain the attenuation phenotype. 
The rcp45 M and rcp45 HN mutant viruses were not defective for 
replication in the respiratory tract of hamsters, and the 
rcp45 3"N virus showed only a marginal decrease in replication 
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in the lower respiratory tract. This suggests that the 
mutations present in the 3" leader, in the N gene start site 
and the N , M and HN proteins are not attenuating in and of 
themselves. However, these mutations could contribute to the 
5 overall attenuation of cp45 in the context of the other cp45 

mutations. Also, individual mutations within the 3'N region 
may have effects which are not apparent when the set of 
mutations is analyzed together^ which can be readily 
determined according to the present disclosure. 

10 Replication of the rcp45 C and rcp45 F mutant 

viruses was approximately 100- fold reduced in both the nasal 
turbinates and the lungs, demonstrating that the mutations 
present in the C and F proteins of cp45 confer the attenuation 
phenotype in hamsters, although the level of attenuation is 

15 not as great as that conferred by the cp45 L mutations. As 

described above, the rcp45 F and rcp4 5 C mutant viruses did 
not possess the ts phenotype, and therefore, the mutations 
that occur in the C and F proteins are considered to be non- ts 
attenuating mutations. 

20 

EXAMPLE XI 

Recovery of Recombinant. Chimeric PIV3 in which the 
Hemaaalutinin and Fusion Glvcoproteins have been Substituted 

with 

25 Corresponding GlvGOprotelns From PIVl 

Within the present example, a chimeric rPIV virus is 
generated and selected which incorporates one or more 
heterologous genes or large polynucleotide sequences from one 
PIV into a different rPIV background. Within this aspect of 

30 the invention, individual genes or gene segments of one PIV 

are substituted by counterpart sequence (s) from a heterologous 
PIV or other source. In one embodiment described in the 
present Example, tools and methods are provided for 
substituting, e.g., the HN and/or F protective antigens of 

35 HPIVl or HPIV2 into a recombinant HPIV3 to yield a chimeric 

recombinant suitable for developing live-attenuated vaccines. 
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Viruses and cells. 

The PIVl strain used in this study, 
PIVl/Washington/20993/1964 (PIVl/Wash64) , was isolated in 
Washington DC in 1964 and was confirmed to be a virulent wild 
5 type virus in clinical studies in adult human volunteers 

(Murphy et al . Infect, Immun. 12:62-8 (1975), incorporated 
herein by reference) . It was propagated in LLC-MK2 cells 
(ATCC CCL 7.1) in Opti-MEM I (Life Technologies) with 50 fig/ml 
gentamicin sulfate, 2 mM glutamine and 0.75 fig/ml trypsin 

10 (Catalog No. 3741, Worthington Biochemical Corp., Freehold, 

NJ) . The Greer strain of human PIV2 (Catalog No. 
V-322-001-020, NIAID Repository, Rockville, MD) used in the 
hemagglutination- inhibition assay (HAI) was propagated in the 
same way. The JS strain of human PIV3 virus and its 

15 recombinant derivative from cDNA (rPIV3/JS) with wild type 

phenotype were propagated as described above. The modified 
vaccinia Ankara (MVA) recombinant that expresses the 
bacteriophage T7 RNA polymerase is described in Wyatt et al., 
Virology 210:202-205 (1995) (incorporated herein by 

20 reference) . 

HEp-2 cells were obtained from ATCC (ATCC CCL 23) 
and maintained in Opti-MEM I (Life Technologies) with 2% fetal 
bovine serum (FBS) , 50 /xg/ml gentamicin sulfate and 2 mM 
glutamine . 

25 Construction of a cDNA encoding a complete chimeric 

PIV3-PIV1 antigenome. 

A cDNA encoding a full-length PIV3 antigenomic RNA 
in which the PIV3 HN and F ORFs were replaced by their PIVl 
counterparts was constructed as shown in Figure 14 . cDNA 

30 clones of the HN and F genes of PIVl/Wash64 were prepared from 

RNA extracted from LLC-MK2 cells which had been infected with 
PIVl/Wash64 wild type virus. cDNA was generated using the 
Superscript Preamplif ication System using random hexaraer 
primers (Life Technologies) . PIVl F and HN cDNAs were 

35 amplified with Vent DNA polymerase (New England Biolabs, 

Beverly, MA) using gene specific primer pairs based on 
consensus sequences present in GenBank. All primers described 
below are annotated so that PIV specific sequences are 
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underlined, restriction sites are in italics, nt altered from 
wild type sequences are in lowercase, and start and stop 
codons are in bold. The positive sense PIVl F primer, 
hybridizing to nt 69-97 upstream of the start codon, was 
5 5 ' - GGGAAAGAAtCCAGAGACAAGAACGG - 3 * (SEQ ID NO: 33). The 

negative- sense PIVl F primer, hybridizing to nt 36-61 
downstream of the F stop codon, was 

5 ' - GGTGAAGTTGTGGAT c cATTTGATTG - 3 ' (SEQ ID NO: 34). It carries 
a BamH I site. The positive-sense primer for PIVl HN was 5'- 
10 CAACCTGTAAGG tA cCAGCATCCG -3^ (SEP ID NO: 35). It hybridizes to 

nt 13-36 upstream the HN start codon and carries a Kpn I site. 
The negative-sense PIVl HN primer was 

5 ' - GATATGGTGTT a GGcCrrGATCTGTTC -3 ' (SEQ ID NO: 36). It 
hybridizes to the last two nt of the stop codon and 25 nt 

15 further downstream and carries a Stu I site. The PIVl F cDNA 

was cloned as a Ba/nH I and blunt-end fragment into BainH I-EcoR 
V digested pLITMUS28 (New England Biolabs) , while the PIVl HN 
cDNA was cloned as a iCpn I -Stu I fragment into the same 
vector. The nt sequences of the resulting plasmids, 

20 designated as pLit.lHNwt and pLit.lFwt (GenBanK Accession 

number: AF016280, AF016279) , were determined using the 
Circumvent Sequencing Kit (New England Biolabs) . These two 
clones were modified (Fig. 14) using mutagenic PGR primers to 
delete their non-coding regions and to introduce new 

25 restriction sites flanking their start and stop codons for the 

purpose of constructing the PIV3-1 chimeric HN and F genes. 
The sequences of positive-sense and negative -sense PIVl F 
mutagenic primers were 5' -CqcGATG aAAAAATCAGAGATCCTCTTCT -3 ^ 
(SEQ ID NO: 37) and 5 ' - Ct qrqa tec t AATTGGAGTTGTTACCCATGTA - 3 ^ (SEQ 

30 ID NO: 38), respectively, the introduced restriction sites are 

Nco I and BamH I. The sequences of positive- sense and 
negative sense PIVl HN mutagenic primers were 5'- 
AACcATG GCTGAAAAAGGGAAAA -S ^ (SEQ ID NO: 39) and 5'- 
GGTGAaGCTt AAGATGTGATTTTACATATTTTA -3 ' (SEQ ID NO: 40), 

35 respectively, the introduced restriction sites are Nco I and 

Hind III. 

The second step involved the modification of the 
PIV3 HN and F genes to serve as acceptors of the PIVl coding 
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regions generated above. The PIV3 F and HN genes were 
subcloned in several steps from the full-length clone of 
PIV3/JS, p3/7(131)2G to generate pLit . PIV3 .HN. 4 and 
pLit .PIV3 .F.3a (Fig. 14). The PIV3 F or HN coding regions 
5 were deleted by PGR mutagenesis and replaced by restriction 

sites appropriate to accept the PIVl HN and F cDNA described 
above. The sequences of the PIV3 F positive sense and 
negative sense mutagenic primers were 

5 ' 'MATAaaa tc cCTACAGATCATTAGATATTAAAAT - 3 ' (SEQ ID NO: 41) and 

10 5 ' -CacCAr qGTGTTCAQTQCTTGTTG - 3 ' (SEQ ID NO: 42), respectively, 

the introduced restriction sites were BamH I and Nco I. The 
sequences of PIV3 HN positive sense and negative sense 
mutagenic primers were 5 ' - CCACAAqC tTAATTAACCATAATATGCATCA - 3 ' 
(SEQ ID NO: 43) and 5/ -TTCCATcy aATTTGGATTTGTCTATTGGGT -3 ^ (SEQ 

15 ID NO: 44), respectively, the introduced restriction sites 

were Hind III and Nco I. The stop codon for the chimeric HN 
would be regenerated upon ligation with mutagenized PIVl HN 
cassette. The PIVl HN or F cDNAs described above were 
imported in as a Nco I -Hind III fragment for HN or as a Nco 

20 l-Bairm I fragment for F, which generated pLit.PIV3-l.HNhc and 

pLit.PIV3-l.Fhc. These two cDNAs were then joined into 
pSE. PIV3-1 .he, which was subsequently sequenced in its 
entirety. Then the BspE I-Sph I fragment was inserted into 
p3/7{131)2G to generate pFLC.2G+.hc (Fig. 14). The nucleotide 

25 sequence of this BspE 1-Sph I fragment, containing the 

chimeric F and HN genes, is in GenBank (Accession No. 
AF016281) . Importantly, the cDNA engineering was designed so 
that the final rPIV3-l antigenome conformed to the rule of six 
(Durbin et al. Virology 234:74-78 (1997), incorporated herein 

30 by reference) . The plasmid pFLC.2G+.hc encoding the rPIV3-l 

chimera was deposited on 9/17/97 under the terms of the 
Budapest Treaty with the American Type Culture Collection 
(ATCC) of 12301 Parklawn Drive, Rockville, Maryland 20852, 
U.S.A. 

35 Transf ection. HEp-2 cell monolayers were grown to 

confluence in six-well plates (Costar Corp, Cambridge, MA) and 
transf ections were performed as described above. Cell 
monolayers were infected with MVA-T7 at a multiplicity of 
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infection (MOD of 3 in 0.8 ml of serum-free Opti-MEM I 
containing 50 /xg/ml gentamicin, and 2 mM glutamine and then 
transf acted with the three support plasmids, 0.4 /xg pTM(N), 
0.4 ^ig pTM(P), 0.05 /xg pTM(L) , and 5 /xg the PIV3-1 antigenome 
5 cDNA. The plasmids were mixed in 0.2 ml of Opti-MEM I 

containing 12 ptl LipofectACE (Life Technologies) and added to 
each well. The PIV3/JS cDNA plasmid, p3/7(131)2G, which 
encodes wild type PIV3/JS antigenomic RNA, was transfected in 
parallel as a positive control. After incubation at 32'*C for 

10 12 hours, the transf ection medium was replaced with 1.5 ml of 

fresh Opti-MEM I supplemented with 50 /xg/ml gentamicin, and 
cultures were incubated at 32**C for two additional days. 
Trypsin was added to a final concentration of 0.75 ;xg/ml on 
day 3 post transf ection . Cell culture supernatants were 

15 harvested on day 4 and passaged (referred to as passage 1) 

onto fresh LLC-MK2 cell monolayers. After overnight 
adsorption, the medium was replaced with fresh Opti-MEM I with 
0.75 fig /ml trypsin. Passage 1 cultures were incubated at 32*^C 
for 4 days, and the amplified virus was harvested and further 

20 passaged on LLC-MK2 cells (referred to as passage 2) for 

another 4 days at 32^C in the presence of 0.75 iig/ml trypsin. 
The recovered vims was characterized by reaction with rabbit 
PIVl antiserum, guinea pig PIV2 antiserum, and two PIV3 HN 
monoclonal antibodies in a hemagglutination-inhibition (HAI) 

25 assay as described previously (van Wyke Coelingh et al., 

supra, (1985) ) with the exception that chicken red blood cells 
(RBCS) were used for viruses possessing the PIVl HN 
glycoproteins, whereas guinea pig RBCs were used for PIV3 . 
The HAI was performed to determine if rPIV3 possessed the 

30 monoclonal antibody resistant mutation (MARM) that marked 

virus recovered from cDNA or if rPIV3-l possesses the PIVl HN. 

Nucleotide sequence analysis. The chimeric cDNA 
construct pSE.PIV3-l .he was sequenced with the Circumvent 
Sequencing kit (New England Biolabs) before the final assembly 

35 into full-length clone pFLC.2G+.hc. For analysis of RNA, the 

appropriate PIVs were amplified in T75 flasks of LLC-MK2 
cells. Virus was harvested on day 5 post-infection and 
concentrated by polyethylene glycol precipitation (Mbiguino et 
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al., J. Virol. Methods 31:161-70 (1991)). Viral RNA was 
extracted from the viral pellets and used in reverse 
transcription (RT) with the Superscript Preamplif ication 
System (Life Technologies) , RT-PCR was performed using the 
5 Advantage cDNA synthesis kit (Clontech) and PIVl or PIV3 

specific primer pairs. RT-PCR products were gel purified by 
electrophoresis onto, and elusion from, strips of NA45 DEAE 
nitrocellulose membrane (Schleicher & Schnuell, Keene, NH) , 
and were sequenced. 

10 Replication of PIVs in LLC-MK2 cells. Plaque 

enumeration on LLC-MK2 monolayers was performed as previously 
described except that trypsin was added in the case of the 
PIVl and rPIV3-l viruses (Hall et al . Virus Res. 22:173-84 
(1992)). Briefly, serially-diluted virus was inoculated onto 

15 LLC-MK2 monolayers in six-well plates; virus was adsorbed for 

1 hour at 32®C; the cultures were overlaid with L-15 medium 
(Life Technologies) containing 0.8% agarose with or without 
added trypsin and incubated at 32^C for 6 days; and plaques 
were identified by hemadsorption (HAD) with guinea pig red 

20 blood cells following the removal of the agarose overlay. 

Growth of the PIV viruses in tissue culture was 
evaluated by infecting confluent LLC-MK2 monolayers on 
twelve-well plates with virus at a MOI of 0.01. After 
adsorption at 32°C for 1 hour, the inoculum was replaced with 

25 1.5 ml/well Opti-MEM I supplemented with gentamicin (50 /ig/ml) 

and trypsin (0.75 fig /ml) , and further incubated at 32*^0 for 6 
days. At 24 hour intervals, 0.3 ml medium was removed from 
each well and 0.3 ml fresh medium with trypsin was added back. 
The titer of virus was determined at 32^ C by hemadsorption 

30 assay on LLC-MK2 cell monolayers using fluid overlay as 

previously described (Hall et al. Virus Res. 22:173-84 
(1992)), and the titers were expressed as logioTCIDso/ml . 

Replication of PIVs in the respiratory tract of 
hamsters. Golden Syrian hamsters in groups of 12 were each 

35 inoculated intranasally with 0.1 ml of L-15 medium containing 

10^ pfu of rPIV3/JS, rPIV3-l, or PIVl/Wash64 . On days 4 and 5 
post -inoculation, six hamsters from each group were 
sacrificed, and their lungs and nasal turbinates harvested. 
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and homogenized, and virus present in the samples was titered 
on LLC-MK2 cell monolayers at 32*»C. The titers were expressed 
as mean logio TCIDgo/g for each group of six hamsters. 

5 RESULTS 

Construction of a cDNA clone encoding a full-length, 
chimeric PIV3-1 antigenomic RNA yielded recovery of the 
Chimeric Virus rPIV3-l. 

As noted above, the final construct of the PIV3 and 

10 PIVl chimeric cDNA, in which the ORFs of the JS wild type PIV3 

HN and F glycoprotein genes were replaced by those of 
PIVl/Wash64 coding sequences (Fig. 14) encodes a PIV3-1 
chimeric antigenomic RNA of 15,516 nt, which conforms to the 
rule of six (Durbin et al., Viroloav 234:74-78 (1997)). The 

15 pFLC.2G+.hc cDNA encoding the chimeric PIV3-1 antigenome was 

transfected onto HEp-2 cells together with the N, P and L 
support plasmids. The p3/7(131)2G cDNA encoding the JS wt 
PIV3 antigenome was transfected in parallel to generate a 
rPIV3 control parental virus. Virus was recovered from each 

20 transfection following the second amplification on LLC-MK2 

cells, and studies were initiated to confirm that each 
recombinant virus was derived from cDNA. 

Recombinant viruses rPIV3-l and rPIV3 were first 
characterized for the presence of the PIVl or PIV3 HN 

25 glycoprotein by HAI assay with serotype-specif ic anti-HN 

monoclonal or polyclonal antibodies. As shown in Table 12, 
rPIV3 reacted with only one of the two PIV3 mAbs as expected, 
whereas its biologically derived parent PIV3/JS reacted with 
both. This confirmed that rPIV3 contained the introduced MARM 

30 mutation that marks this virus as being derived from cDNA. 

The rPIV3-l virus reacted with antibodies specific to the PIVl 
HN glycoprotein, but not to ones specific to HN of PIV3 or 
PIV2, showing that the virus contained the PIVl HN gene as 
expected. 
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Table 12. rPIV3-l possesses the HN glycoprotein of PIVl 



Hemagglutination- inhibition titer^ (reciprocal) of indicated 
monoclonal antibody or polyclonal antiserum 

Virus 





PIVl^ 
antiserum 


PIV2^ 
antiserum 


Of-PIV3 mAb 
423/6^ 


a-PlV3 mAb 
77/5^ 


PIVl/Wash€4 


256 




s50 


550 


rPIV3-l 


64 


s2 


550 


550 


rPlV3/JS 


4 


s2 


s50 


3,200 


PIV3/JS 


8 


s2 


12,800 


6,400 


PIV2 /Greer 


8 


512 


550 


550 



®Chick red blood cells (RBC) were used in HAI assay for PIVl, PIV2, and 
rPIV3-l and guinea pig RBCs were used for PIV3/JS and rPIV3/JS. 



^PIVl rabbit antiserum was purchased from Denka Seiken Co. Ltd., Japan 
(Catalog No. 410-701) , and PIV2 guinea pig antiserum was obtained 
f rom NIAID repository, Rockville, MD (Catalog No. V-322-50-558) . 

^Biologically derived PIV3/JS contains epitopes recognized by both mAb 
423/6 and 77/5, whereas rPIV3/JS was engineered to lack reactivity 
with mAb 423/6. 

*^The PIV2 antiserum had some reactivity with PIVl virus, and therefore is 
not completely type specific. 



It was next confirmed that the rPIV3-l virus 
contained the engineered, chimeric PIV3-1 HN and F genes. As 
designed, the genetic structure of rPIV3-l was unique in four 
junction regions when compared with either of its parents, 
PIVl/Wash64 or rPIV3/JS (boxed in Figure 15A) . These regions 
are the transition points at which the sequence switches from 
the PIV3 non- coding region to the PIVl coding region and then 
from the PIVl coding region back to the PIV3 non- coding 
region. Using the primer pair A specific to PIV3 M and L 
genes, or primer pair B specific to the PIVl M and the very 
3 '-end of HN gene, RT-PCR products were generated for 
virion-derived RNAs from rPIV3-l, rPIV3/JS, and PIVl/Wash64 . 
Control reactions showed that the RT step was required for 
generation of RT-PCR products, indicating that an RNA 
template, rather than contaminating DNA, was required to 
produce the RT-PCR product. An early indication that rPIV3-l 
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was indeed a chimeric vims came from the finding that only 
the PIV3-specif ic primer pair A generated the expected 4.6 kb 
cDNA product that spans the F and HN genes (Fig. 15B) . Thus, 
while rPIV3-l virus contains only HPIVl-specif ic HN 
5 glycoprotein (See Table 12) , the non~coding regions are 

specific to PIV3 . Conversely, the PIVl specific primer pair B 
amplified an appropriately sized product from PIVl control but 
not from rPIV3-l. Restriction digestion analysis also 
demonstrated that rPIV3-l RT-PCR product had unique 

10 restriction patterns different from that of rPIV3/JS and 

PIVl/Wash64 and appropriate for its predicted sequence. 

The nt sequence of the 4,6 kb RT-PCR product of 
rPIV3-l was determined in its four regions (Fig. 15 A) and 
compared with that of rPIV3/JS and PIVl/Wash64 (Fig. 16) . The 

15 rPIV3-l sequence was completely in agreement with the cDNA 

from which it was derived. Examination of the sequence 
alignment of the Region I -IV for the three RT-PCR products 
illustrates that rPIV3-l contains the PIVl F and HN 
glycoprotein ORFs with altered start and stop codons and 

20 flanked by the 5' and 3' non-coding regions of PIV3 . Examples 

of sequencing ladders spanning the Region III and IV of 
rPIV3-l (Fig. 17), compared in parallel with rPIV3/JS or 
PIVl/Wash64, were evaluated, and this analysis confirmed that 
rPIV3-l is a recombinant chimeric virus whose structure is 

25 completely in agreement with the cDNA from which it was 

generated . 

Trypsin-dependence and cytopathicity of rPIV3-l in 

vi tro . 

PIVl, like Sendai virus but contrary to PIV3, 
30 requires trypsin for cleavage of its F glycoprotein in order 

to undergo multicycle replication on continuous lines of 
tissue culture cells (Frank et al. J. Clin. Microbiol. 10:32-6 
(1979)). In addition, PIVl is a non-cytopathic virus whereas 
PIV3 readily produces extensive CPE (Collins et al. In Fields 
35 Virology . 3rded,, 1:1205-43 (1996)). rPIV3-l, rPIV3 and 

PIVl/Wash64 were compared on the basis of these properties. 
rPIV3-l, like PIVl/Wash64, had a higher HA titer using 
chicken, rather than guinea pig (Table 13), RBCs. rPIV3-l, 
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like its PIVl/Wash64 parent, required trypsin for efficient 
replication in cultures with fluid overlay as well as for 
efficient plaque formation. rPIV3-l produced plaques at 32°C, 
37°C or 40®C with similar efficiency. It is therefore evident 
that rPIV3-l possesses the F glycoprotein of the PIVl parent 
virus, and it is not temperature sensitive. On the other 
hand, rPIV3-l produced CPE, as indicated by the cell rounding 
and detaching in the virus infected monolayers, almost to the 
same extent as its PIV3 parent suggesting that this biological 
property is a function of its PIV3 genes, which lie outside of 
the HN and F coding regions. Thus, rPIV3-l possesses 
biological properties from both parents which is consistent 
with the findings above demonstrating that it is a chimeric 
virus. This exemplary recombinant, chimeric virus within the 
invention was deposited on May 21, 1998 under the terms of the 
Budapest Treaty with the American Type Culture Collection 
(ATCC) of 12301 Parklawn Drive, Rockville, Maryland 20852, 
U.S.A. 
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Table 13. Comparison of the HA titer and the inf activity and 
cytopathicity of parental and chimeric PIVs^ 

HA titer Infectious titer** 

using indicated (LogjoTCIDgo/ml) 

RBC using CPE or HAD as 

endpoint 

Virus 



CPE HAD 

Chicken Guinea 

Pig Trypsin Trypsin Trypsin 



+ - + - + 



PIVl/Wash64 


16 


8 


s2.5 


s2.5 


4.8^ 


6. 


.3 


sO.7^ 


5. 


8 


rPIV3-l 


64 


16 


s2.5 


5.8 


5.5^ 


7. 


.8 


sO.7^ 


7, 


,1 


rPIV3/JS 


0 


8 


4.5 


7.3 


5.0^ 


7. 


.5 


5.0 


6 


.2 



*Virus stocks were grown in LLC-MK2 cells which were infected at an MOI of 
0.01 and incubated for 6 days in the presence (PIVl/Wash64 , rPIV3-l) 
or absence (rPIV3/JS) of 0.75 /ig/ml trypsin. The resulting virus 
stocks were assayed by the tests below in the presence or absence of 
trypsin as indicated. 

*^he TCID50 assay was read at 6 days by direct visualization of CPE or by 
hemadsorption (HAD) . 

^Plaques were visualized by HAD after six days of incubation. 

**The HAD of PIV3- infected monolayers was grossly apparent whereas that of 
PIVl and rPlV3-l was observcible only under the microscope in which 
single cells with RBC adsorbed were observed. 

®The lowest level of virus detectable was 10° *'/ml. 

Comparison of the level of replication of rPIV3-l and its 
parental viruses in LLC-MK2 cells and hamsters. 

The multicycle replication of rPIV3, rPIV3-l, and 
PIVl Wash/64 viruses was evaluated following inoculation of 
LLC-MK2 tissue culture cells at a MOI of 0.01 (Fig. 18). It 
can be seen that the kinetics and magnitude of replication of 
the three viruses are very similar. This indicates that the 
substitution of the HN and F genes of PIVl for those of PIV3 
did not attenuate the virus for replication in vitro. It was 
next determined whether rPIV3-l was attenuated in vivo, 
specifically for replication in the upper and lower 
respiratory tracts of hamsters (TABLE 14) . The observed level 
of replication of rPIV3-l was similar to, if not slightly 
higher than, either parent in the upper and lower respiratory 
tract of hamsters. 



PFU/ml*= 
(Logio) 
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Table 14. Level o£ replication o£ parental and chimeric PIVs in the upper 
and lower respiratory tract of hamsters^ 



Virus titer (mean log^o TCIDjo/gramiS .E. ) 
in indicated tissue 



Virus 



Nasal Turbinates Lungs 

Day 4 Day 5 Day 4 Day 5 

PIVl/Wash64 5.2±.24 5.2±.12 5.0±.31 5.0±.38 

rPIV3-l 4.9±.23 6.2±.17 5.8±.15 6.0±.09 

rPIV3/JS 4.5±.09 5.0±.18 5.1±.26 5.0±.32 

^Hamsters were infected intranasally with 10^*° TCID50 per animal of the 

indicated virus, and lungs and nasal turbinates were removed on day 4 
or 5 after infection. The titers are mecuis of six animals per day 
and are expressed as mecui logjoTCIDso/gram ± standard error. 

In summary, the present Example demonstrates 
successful recovery of a rPIV3-l chimeric virus in which the 
ORFs of the PIVl HN and F glycoproteins were substituted for 
those of rPIVB. This chimeric virus replicated like its wild 
type PIVl and PIV3 parental viruses in vitro and in vivo, 
demonstrating that the substitution of the glycoprotein ORFs 
did not result in attenuation of rPIV3-l. This successful 
recovery of a recombinant PIVS which bears the HN and F 
glycoproteins of PIVl is surprising because the two viruses, 
representing distinct serotypes, are not closely related. In 
particular, it is remarkable that the chimeric recombinant 
grows as well as the two parents. Notably, chimeric 
recombinant viruses possessing a substitution in the 
glycoprotein gene have also been recovered for vesicular 
stomatitis virus (VSV) (Lawson et al . Proc. Natl. Acad. Sci. 
USA 92:4477-81 (1995)). Specifically, the VSV G glycoprotein 
gene of Indiana serotype was replaced by that from New Jersey 
serotype which share only 50% amino acid sequence identity. 
In contrast to rPIV3-l, the chimeric recombinant VSVi/^j 
replicates to only 10% the level of recombinant VSVj or 
biologically derived, VSV. 

In the present Example, the HN and F glycoproteins 
have 43 and 47% sequence identity, respectively, between PIVl 
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and PIV3. The transfer of the two glycoproteins together 
would, of course, obviate glycoprotein-to-glycoprotein 
incompatibility (Tanabayashi , K. and Compans, R.W. J. Virol. 
70:6112-18 (1996)). On the other hand, it is generally 
5 thought that the glycoproteins interact with the M protein 

(which is 63% identical between PIVl and PIV3) through their 
cytoplasmic (CT) or transmembrane (TM) domains, and that this 
interaction is important in virion morphogenesis and 
structure. In this regard, the degree of sequence identity 

10 between the HN and F proteins of the two serotypes in the TM 

and CT domains is low indeed: 30% and 22%, respectively for 
the TM domain, and 9 and 11% respectively for the CT domain. 
In light of this low level of sequence relatedness, we have 
also pursued a parallel strategy of constructing chimeric 

15 glycoproteins in which the PIVl ectodomain of each 

glycoprotein was fused to the PIV3 TM and CT domains . 
Regarding possible interaction with the M protein or other 
internal proteins, it might be that a conserved structure, 
such as a constellation of charged amino acids, is important 

20 rather than a conserved sequence. Alternatively, it might be 

that interaction of the TM and CT domains of the glycoproteins 
with internal proteins is not as critical as has been 
previously thought. It will be possible to examine these 
factors more closely using the methods and tools provided 

25 herein. For example, these factors will be further elucidated 

by work in progress employing the methods described above to 
construct a PIV3 virus bearing HN and F of PIV2. 

It was expected that rPIV3-l would require trypsin 
for efficient replication in tissue culture since this is a 

30 property conferred by the PIVl F glycoprotein, and this was 

found to be the case. However, it was interesting to observe 
that rPIV3-l caused CPE that more closely resembled that of 
PIV3 parent virus, indicating that a PIV3 gene{s) other than 
HN or F specifies this phenotype. These roles will also be 

35 further elucidated using the methods and tools provided herein 

to exchange additional gene(s) between the non-cytopathic PIVl 
and the cytopathic PIV3 . 
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EXAMPLE XII 

Recovery of Live-Attenuated Chimeric Recombinant PIV Encoding 
the Internal Proteins of PIV Type 3 and the Surface 
Glycoproteins of PIV Type 1 

5 In the present Example, a derivative of rPIV3-l 

carrying the three temperature- sensitive and attenuating amino 
acid coding changes found in the L gene of the live-attenuated 
cp45 PIV3 candidate vaccine virus, termed rPIV3-l . cp45L, is 
shown to exhibit a temperature sensitive phenotype with a 

10 shut -off temperature of 38<>C, similar to that of the 

recombinant rPIV3cp45L which possesses the same three 
mutations. rPIV3-l . cp45L is attenuated in the respiratory 
tract of hamsters to the same extent as rPIV3cp45L. Infection 
of hamsters with rPIV3-l . cp45L generates a moderate level of 

15 hemagglutination- inhibiting antibodies against wt PIVl and 

induces complete resistance to challenge with wild type PIVl. 
This demonstrates that attenuated chimeric PIV according to 
the invention are capable of inducing a highly effective 
immune response against PIVl. This disclosure also confirms 

20 the above described data demonstrating that the surface 

glycoproteins of parainfluenza viruses are sufficient to 
induce a high level of resistance to homologous virus 
challenge. Unexpectedly, infection with recombinant chimeric 
virus rPIV3-l . cp45L or rPIV3-l, each bearing the surface 

25 glycoprotein genes of PIVl and the internal genes of PIV3, 

also induces a moderate level of resistance to replication of 
PIV3 challenge virus. This indicates that the internal genes 
of PIV3 can independently induce protective immunity against 
PIV3 in rodents. Thus, a reverse genetics system for PIV3 as 

30 disclosed herein successfully produces live attenuated PIVl 

vaccine candidates that are attenuated and protective in 
accepted model subjects. 

Viruses and cells. 

The wt PIVl strain used in this study is 
35 PIVl/Washington/20993/1964 (PIVl/Wash64) (see, eg.. Murphy et 

al., Infect. Immun. 12 :62. 1975 (incorporated herein by 
reference in its entirety) . Chimeric rPIV3-l, recovered from 
chimeric PIV3 cDNA in which the PIV3 F and HN ORFs were 
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replaced with those of PIVl/Wash64, as described above and in 
Tao et al., J. Virol. 72 :2955, 1998 (incorporated herein by 
reference in its entirety) • These viruses were propagated in 
LLC-MK2 cells (ATCC CCL 7.1) in Opti-I4EM I (Life Technologies, 
Gaithersburg, MD) with 50 iig/ml gentatnicin sulfate, and 0.75 
fig/ml trypsin (Catalog No. 3741, Worthington Biochemical 
Corp., Freehold, NJ) . Trypsin is included because the F 
glycoprotein of PIVl, but not that of PIV3, is dependent on 
exogenous trypsin for cleavage when grown in cell culture 
under these conditions. The wt JS strain of human PIV3 virus 
and its recombinant derivative from cDNA (rPIV3/JS) were 
propagated as described above and in Durbin et al.. Virology 
235:323, 1997 (incorporated herein by reference in its 
entirety). The propagation of cp45, an attenuated derivative 
of wt PIV3/JS (see above; and Karron et al., J. Infect. Bis. 
171:1107, 1995 (incorporated herein by reference in its 
entirety)), and rPIV3cp45L, a recombinant PIV3 carrying the 
three ts mutations found in the L gene of cp45, were 
propagated as described above and in Skiadopoulos et al . , J 
Virol 72 :1762, 19 9 8 (incorporated herein by reference in its 
entirety) . The modified vaccinia Ankara (MVA) recombinant 
that expresses the bacteriophage T7 RNA polymerase is 
described in Virology 210 : 202, 1995 (incorporated herein by 
reference in its entirety) . 

HEp-2 cells, which are used in transf ection, were 
obtained from ATCC (ATCC CCL 23) and maintained in Opti-MEM I 
with 2% fetal bovine serum (FBS) , 50 ^g/ml gentamicin sulfate. 

Introduction of L mutations into rPIV3-l antigenomic 

cDNA. 

The three L mutations of cp45 present in the 
pTM(L) 942/992/1558 plasmid, described above (see also, 
Skiadopoulos et al., J Virol 72 :1762, 1998, were introduced 
into chimeric cDNA pFLC.2G+.hc (described above; see also, Tao 
et al., J Virol 72 :2955, 1998), as a 2.8 kb Sphl-Nhel 
fragment (nt 11313 to 14052 in PIV3 antigenomic cDNA) to 
generate the full-length pFLC.2G+.hc.cp45L bearing the PIVl F 
and HN ORFs and the three cp45 L gene mutations (Fig. 19) . 
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The specific mutations present in pTM(L) 942/992/1558 are 
indicated in the legend to Figure 19. 

Transf ection. HEp-2 cell monolayers in six-well 
plates were grown to confluence and transf ections were 
performed as described above (see also, Tao et al . , J Virol 
72:2 955, 1998) . Trypsin was added to a final concentration of 
0.75 iig/ml on day 3 post transf ection prior to harvesting on 
day 4 . Cell culture supernatants were clarified and passaged 
(referred to as passage 1) onto fresh LLC-MK2 cell monolayers. 
After overnight adsorption, the medium was replaced with fresh 
Opti-MEM I with 0.75 fig/ml trypsin. Passage 1 cultures were 
incubated at 32^C for 4 days, and the virus present in the 
supernatant was harvested and passaged again under the same 
conditions (referred to as passage 2) . Virus present in the 
passage 2 harvest was tested for the presence of the PIVl HN 
protein by hemagglutination- inhibition (HAI) assay as 
described above (see also Tao et al., J Virol 72 ; 2955, 1998). 

Replication of PIVs in LLC-MK2 at various 
temperatures . 

Plaque enumeration on LLC-MK2 monolayers was 
performed as described above, with 0.75 /xg/ml trypsin added to 
the agarose overlay in the case of PIVl, rPIV3-l, and 
rPIV3-l.cp45L (see also, Tao et al., J Virol 72 :2955. 1998). 
After incubation at various temperatures for 6 days, the 
agarose overlay was removed and plaques were identified by 
hemadsorption (HAD) with guinea pig erythrocytes (RBCs) . 

Replication of PIVs in the respiratory tract of 

hamsters* 

Groups of five hamsters were inoculated intranasally 
with 0.1 ml of LIS medium containing 10^ plaque forming units 
(PFU) of rPIV3/JS, rPIV3cp45L, cp45, PIVl/Wash64, rPIV3-l, or 
rPIV3-l . cp45L. Hamsters were sacrificed on day 4 
post -infect ion, and their lungs and nasal turbinates were 
harvested and homogenized. Virus present in the tissue 
samples was titered on LLC-MK2 cell monolayers at 32 as 
described above and in Tao et al., J Virol 72 :2955. 1998. 
The titers are expressed as reciprocal mean logioTCIDgo/gram of 
tissue for each group. 
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Immunization and challenge studies in hamsters. 

Groups of ten hamsters were immunized intranasally 
with 10^ PFU of virus per animal, as described above. Serum 
was collected for HAI assay prior to infection and on day 33. 
The level of HAI antibodies present in the sera of each group 
of 10 hamsters was determined using PIVl/Wash64 and PIV3/JS as 
antigens, and the HAI titers determined are presented as mean 
logj (see also, Tao et al., J Virol 72 :2955. 1998). 

Thirty-five days post-immunization, five hamsters 
from each group were challenged intranasally with 10^ PFU of 
either PIVl/Wash64 or rPIV3/JS. Nasal turbinates and lungs of 
these challenged hamsters were harvested four days post 
challenge. Virus titers in tissue samples were determined on 
LLC-MK2 monolayers as described above and in Tao et al., J 
Virol 72 :2955. 1998, and the titers are presented as mean 
logioTCIDso/gram of tissue. 

RESULTS 

Recovery and characterization of the recombinant 
chimeric virus rPIV3-l .cp45L. 

As noted above, the cDNA clone pFLC.2G+.hc, a 
full-length antigenomic cDNA of PIV3 in which the ORFs 
encoding the F and HN glycoproteins have been replaced by 
those of PIVl, was modified by introduction of three amino 
acid coding changes (designated 942, 992 and 1558, according 
to amino acid position in the L protein) identified in the L 
gene of cp45 and shown to be independent ts and attenuating 
mutations (Fig. 19; see also, Skiadopoulos et al., J Virol 
72:1762, 1998). Each coding change was marked by the 
CO- introduction of contiguous translationally silent nt 
substitutions that ablate a naturally -occurring restriction 
site (Fig. 19; Table 8) . The final full-length plasmid 
construct, pFLC.2G+,hc.cp45L (Fig. 19), encodes a PIV3-1 
chimeric antigenomic RNA of 15516 nt in length and conforms to 
the rule of six (see, Durbin et al., Virology 234 :74. 1997, 
incorporated herein by reference in its entirety) . The 
authenticity of pFLC . 2G+ . he . cp45L was confirmed by digestion 
with appropriate restriction enzymes. 
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The pFLC.2G+.hc.cp45L cDNA was transfected into HEp-2 
cells together with the PIV3 N, P and L support plasmids and 
infected with MVA-T7 as described above and in Tao et al . , J__ 
Virol 72 :2955. 1998) . Virus recovered after two passages on 
5 LLC-MK2 cells, termed rPIV3-l . cp45L, was biologically cloned 

by plaque-to-plaque-to-plaque passage, and amplified virus was 
analyzed to confirm that it possessed the PIVl glycoproteins 
and the three introduced mutations in L. First, the presence 
of the PIVl HN protein in rPIV3-l . cp45L was confirmed by 

10 reactivity with PIVl specific antibodies in HAI assay as 

described above and in Tao et al., J Virol 72 ; 2955. 1998) . 
The presence of the chimeric PIV3-1 HN and F genes as well as 
the introduced L gene mutations in rPIV3-l . cp45L genomic RNA 
was confirmed by restriction enzyme digestion or nucleotide 

15 sequence analysis of RT-PCR products generated from virion RNA 

as described above and in Tao et al . , J Virol 72 ; 2955, 1998. 
These data confirmed that rPIV3-l . cp45L is a recombinant 
chimeric virus bearing the three codon substitutions of the L 
gene of cp45. 

20 rPIV3-l.cp45L is temperature sensitive. 

The three L gene mutations of cp45 were shown above 
to confer the ts phenotype when introduced into wt PIV3 (see 
also, Skiadopoulos et al., J Virol 72 ; 1762, 1998). To 
evaluate whether their presence in the chimeric virus would 

25 have the same effect, the efficiency of plaque formation of 

rPIV3-l. cp45L was determined at various temperatures. As 
shown in Table 15, the three L mutations indeed conferred the 
ts phenotype to the chimeric virus. The level of temperature 
sensitivity specified by the cp45 L mutations in the 

30 recombinant viruses rPIV3cp45L and rPIV3-l . cp45L was 

equivalent (Table 15) , indicating that the effect of the 
mutations is independent of the PIV3 or PIVl HN and F 
glycoproteins. The level of temperature sensitivity of 
rPIV3cp45L and rPIV3-l . cp45L was comparable to that of the 

35 biologically derived cp45 virus, despite the fact that the 

latter virus possesses mutations outside of L (see. Stokes et 
al.. Virus Res 30 ; 43, 1993, incorporated herein by reference 
in its entirety. 
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Table 15. The recombinant chimeric rPIV3-l.cp45L candidate 
vaccine virus is temperature sensitive 

5 



virus * Virus titer ^ at indicated temperatures 

(logjoPFU/ml) 



10 





32«C 


36^0 


37«C 


38*C 


39^0 


40*C 


rPlV3/JS 


7.4 


7.2 


7.2 


7.2 


7.2 


7.2 


PZVl/Wash64 ^ 


7.5 


7.6 


7.5 


7.5 


7.4 


7.2 


rPIV3-l 


7.5 


7.5 


7.5 


7.2 


6.0 


6.1 


PIV3cp45 ^ 


7.4 


6.9 


6.8 


4.7 ^ 


<0.7 


<0.7 


rPIV3cp45L 


7.9 


7.4 


7.7 


5.3 


1.2 


<0.7 


rPIV3-l.cp45L 


8.1 


8.0 


8.2 


6.1 


<0.7 


<0.7 



15 * Virus nomenclature: rPIV3/JS, recombinant wt PIV3 strain JS; 

PIVl/Wash64, biologically- derived wt PIVl; rPIV3-l, 
recombinant chimeric PIV3 in which the F and HN ORFs have been 
replaced with those of PIVl/Wash64; PIV3cp45, biologically- 
derived cp45 candidate vaccine virus; rPIV3cp45L, recombinant 

20 PIV3 containing the three L gene mutations of cp45; rPIV3- 

l.cp45L, recombinant chimeric rPIV3-l containing the three L 
gene mutations of cp45. 

Virus titers were determined using LLC-MK2 monolayers in 12- 
25 well plates. Titers are the average of two assays. 

Biologically-derived viruses. All others are recombinant 
viruses . 

30 The shut-off temperature, i.e. the lowest restrictive 

temperature at which a two log^o reduction in virus titer is 
observed, of each ts virus is indicated in bold. 

Level of replication o£ rPIV3-l .cp45L in hamsters. 

35 The three L gene mutations of cp45 were shown above 

to confer attenuation of virus replication in the upper and 
lower respiratory tract of hamsters when introduced into wt 
PIV3 (see also, Skiadopoulos et al., J Virol 72 ; 1762, 1998). 
Their effect on the chimeric virus was evaluated by intranasal 

40 infection of hamsters, as shown in Table 16. These findings 

indicate that rPIV3-l . cp45L indeed was attenuated at both 
sites and, furthermore, that its level of attenuation was 
comparable to that of rPIV3cp45L. Thus, the ability of the 
cp45 L mutations to confer attenuation, like temperature 

45 sensitivity, is independent of the antigenic specificity of 

the surface glycoproteins. 
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Table 16. The recombinant chimeric rPIV3-l . cp45L candidate 
vaccine virus is attenuated in the respiratory tract of 
hamsters ® 



5 


Virus 


Virus titer in indicated tissue 
(logioTCIDso/giS.E) ^ 






Nasal turbinates 


Lungs 




rPIV3-l.cp45L 


4.6+0.3 


1.9±0.4 




rPIV3-l 


6.0±0.3 


6.3±0.4 




rPIV3cp45L 


3.0±0.3 


<1.2 


10 


rPIV3/JS 


5.7±0.3 


5.0±0.3 



* Groups of five hamsters were infected intranasally with 
indicated viruses at a dosage of 10^ PFU per hamster. On 
day 4 post infection, the tissue samples were harvested and 
15 assayed for virus. 

^ Virus titers are given as LogioTCIDso per gram of tissue. 



Infection with rPIV3-l or rPIV3-l .cp45L, containing 
20 the internal proteins of PIV3 and the glycoproteins of PIVl, 

confers resistance to PIVl challenge In hsunsters. 

The chimeric rPIV3-l virus and its attenuated 
rPIV3-l . cp45L derivative were evaluated for immunogenicity 
and protective efficacy in hamsters. As shown in Table 17, 

25 infection with either virus induced HAI antibodies against 

PIVl, but not PIV3, confirming that these chimeric viruses 
possess the PIVl HN glycoprotein and are highly immunogenic. 
The level of HAI antibodies induced by rPIV3-l . cp45L was 
two- fold less than that by rPIV3-l, which indicates that its 

30 attenuation resulted in a modest decrease in immunogenicity. 

Similarly, rPIV3 and rPIV3cp45L induced HAI antibodies 
against PIV3, but not PIVl, and the level induced by the 
attenuated virus was approximately two-fold lower. Despite 
the restricted replication in hamsters of the recombinant 

35 viruses bearing the cp45 L mutations, infection with either 

rPIV3cp45L or rPIV3-l . cp45L induced complete resistance to 
replication of challenge virus bearing homologous 
glycoproteins . 
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Infection with rPIV3-l .cp45L also confers resistance 

to PIV3, 

Information on the role of the non-HN or F 
glycoproteins of PIVs (i.e., the internal proteins) in 
5 resistance is limited. The disclosure and use of rPIV3-l and 

rPIV3-l . cp45L herein provides an opportunity to examine the 
role that internal proteins play in resistance to challenge 
with PIV3, since the only genes shared by immunizing and 
challenge viruses are the internal protein genes. PIV3 

10 challenge virus replication was significantly restricted in 

both the upper and lower respiratory tracts by prior infection 
of hamsters with rPIV3-l or rPIV3-l . cp45L (Table 17). Thus, 
these data indicate that the internal proteins of PIV3, like 
the HN and F proteins, are capable of inducing partial 

15 resistance to replication of the challenge PIV3 , 

Among the findings demonstrated by the immunogenicity 
and efficacy studies above a particularly unexpected finding 
was that infection with wt or attenuated rPIV3 induced a 
100-fold reduction in the replication of PIVl challenge virus 

20 in the lungs. Thus, infection with one serotype of PIV 

provided significant protection against a heterologous 
serotype. This was unexpected in part because previous 
studies indicated that infection of animals with one type of 
human PIV did not induce significant heterologous protection 

25 against a PIV belonging to a different human serotype, 

conforming to a general belief that immunity to human PIV 
infections was largely type-specific (see, eg., Cook et al., 
Amer. Jour. Hvg. 77 :150, 1963; Ray et al., J. Infect. Pis. 
162.: 746, 1990, each incorporated herein by reference in its 

30 entirety) . 

The present Example demonstrates successful 
exploitation of novel methods and reagents developed for 
generating PIV3 vaccines to further provide rapid, rational 
development of live attenuated candidate vaccines for PIVl . A 

35 cDNA encoding infectious PIV3 was modified by substitution of 

the ORFs encoding the PIVl HN and F protective antigens for 
their PIV3 counterparts. Subsequently, attenuating mutations, 
exemplified by three attenuating mutations present in the L 
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gene of the cp45 PIV3, were incorporated within this modified 
chimeric PIV3-PIV1 cDNA. From this cDNA, a recombinant virus 
was recovered bearing the HN and F genes of PIVl, the internal 
proteins of PIV3, and the PIV3 cp45 L gene mutations. This 
recombinant, rPIV3-l . cp45L, was temperature sensitive, highly 
attenuated in hamsters, and highly efficacious against PIVl 
challenge in hamsters. The level of temperature sensitivity, 
attenuation, and immunogenicity exhibited by rPIV3-l . cp45L was 
comparable to that of cp45 PIV3, indicating that the 
phenotypes specified by the set of cp45 L gene mutations are 
independent of the HN and F surface glycoproteins. These 
findings, which represent the first live attenuated PIVl 
vaccine candidate generated by reverse genetics, provide a 
generally successful scheme for developing vaccines against 
PIVl. 

Little information is known concerning the role that 
internal proteins of parainfluenza viruses play in resistance 
to reinfection with homologous virus. Infection with vaccinia 
recombinants expressing N, epitopes within N, or M reportedly 
induce resistance to replication of challenge virus, but the 
magnitude of the resistance reported is less than that induced 
by vaccinia recombinants bearing HN or F glycoproteins (see, 
eg., Kast et al., Proc. Natl. Acad. Sci. USA 88 :2283. 1991; 
Sakaguchi et al., J. Gen. Virol. 74 ; 479. 1993; Thomson et al., 
J. Immunol. 157 ; 822. 1996, each incorporated herein by 
reference in its entirety) . These studies suggested that the 
internal proteins were making only minor contributions to 
resistance to reinfection. Therefore, the present disclosure 
presents unexpected results by showing that prior infection of 
hamsters with rPIV3-l . cp45L or rPIV3-l induced about 250-to 
4 000 -fold reduction of replication of PIV3 in both the nasal 
turbinates and lungs. These two chimeric recombinant viruses 
differ from the PIV3 challenge virus in that they possess the 
HN and F glycoproteins of PIVl rather than PIV3 , but they 
share all other genes with the challenge virus. The HN and F 
glycoproteins of PIVl share 47% and 43% sequence identity with 
those of PIV3, respectively. Although it is likely that the 
shared internal proteins are mediating the observed 
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resistance, it is also possible that shared protein sequences 
between PI VI and PIV3 F and HN glycoproteins are contributing 
to the observed immunity. For example, there are 5 stretches 
in HN and 2 stretches in F extending at least 9 amino acid 
5 residues in length that are shared between PIVl and PIV3 and 

have the potential to act as protective CTL epitopes. It is 
reasonable to consider that the shared internal proteins are 
contributing to the restriction of replication of wt PIV3 
challenge virus, since this level of cross -immunity has not 

10 been seen in previous studies (see, eg., Cook et al., Amer . 

Jour. Hyg. 77 :150. 1963; Ray et al., J. Infect. Pis. 162 :746, 
1990, incorporated herein by reference in its entirety) . 

The finding that the internal PIV3 proteins of the 
rPIV3-l and rPIV3 -1 . cp45L chimeras conferred resistance to 

15 PIV3 challenge demonstrates that attenuatfed derivatives of 

PIV3 can be used as vectors for PIVl and PIV2 protective 
antigens. Following the teachings of the invention, 
immunization with one PIV3 -based live-attenuated vaccine virus 
can restrict the replication of other PIV3 -based vaccine 

20 viruses administered subsequently, thereby decreasing the 

immunogenicity of the second virus. Since PIV3, like RSV, 
induces significant illness in early infancy, a combined 
RSV-PIV3 vaccine for use in the very young 2- to 4 -week old 
infant is therefore an important aspect of the invention (see, 

25 eg., Collins et al.. Fields Virolocrv 3rd ed. Philadelphia: 

Lippincott -Raven Publishers, 1205(1), 1996; Reed et al., J. 
Infect. Pis. 175 :807. 1997, each incorporated herein by 
reference in its entirety) . According to this aspect of the 
invention, immunization with a PIV1-PIV2 vaccine will be 

30 preferably initiated at about 6 months of age, since most PIVl 

and PIV2 disease occurs after the age of six months. In the 
possible circumstance that immunization with rPIV3 cp45 
significantly inhibits replication of a chimeric recombinant 
PIV3-1 vaccine virus with which it shares internal protein 

35 genes, successful immunization with a recombinant PIV3-1 

vaccine may be compromised. In this event, a trivalent PIV 
vaccine will be administered simultaneously rather than 
sequentially, thereby preventing the above noted inhibition. 
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The disclosure herein that infection with a vaccine 
or wt PIV3 would induce a 100 -fold reduction of pulmonary 
virus replication of the heterologous wt PIVl was clearly 
unexpected, in part because the human PIV viruses are 
serologically distinct by neutralization assay, and previous 
studies in hamsters found that prior infection with one type 
of PIV failed to induce resistance to challenge with a high 
dose of a different PIV type (see eg., Cook et al., Amer. 
Jour. Hyq. 77 ; 150, 1963; Ray et al., J. Infect. Pis. 162 ; 746, 
1990; Cook et al., Amer. Jour. Hvq. 69 ; 250, 1959). 
Furthermore, there is little epidemiological data documenting 
that prior infection with one PIV significantly modifies 
subsequent infection with a heterotypic PIV. 

In summary, the present Example shows that rPIVB was 
successfully converted into a vaccine for PIVl by substituting 
the ORFs encoding the F and HN glycoproteins and introducing 
known attenuating mutations into the PIV3 internal genes. 
Thus, the extensive methods and reagents provided herein can 
be applied directly and predictably to attenuating the PIV3 
backbone of the rPIV3-l chimeric virus, as well as for 
generating live-attenuated PIV2 vaccine viruses. 

The foregoing disclosure makes it possible to exploit 
the reagents and methods provided herein to develop a broad 
assemblage of PIV and related vaccines. In this context, 
recovery of live, immunogenic chimeras between PIV3 and PIV2 
exemplifies powerful new tools for developing a range of 
recombinant PIV viruses for vaccine use. In conjunction with 
this work, identification and characterization of the genetic 
basis for attenuation of naturally occurring PIV mutants, 
e.g., cp45 and BPIV3 vaccine candidates, following the 
teachings of the present disclosure also enables development 
of a large host of recombinant vaccine viruses and subviral 
particles. In particular, desired mutations present in 
biologically derived mutant viruses will be readily identified 
and selected by their introduction, singly and in combination, 
into a wild type, partially attenuated, or chimeric PIV 
background, as shown in the Examples above. These findings 
will expand the menu of exemplary, attenuating mutations 
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within the invention which can introduced into PIV clones to 
calibrate the level of attenuation and immunogenicity in 
vaccine recombinants. Biologically derived mutations can also 
be introduced within PIV clones having different types of 
mutations, e.g., mutations involving alterations, deletions, 
or substitutions of a gene or gene segment. Within this 
aspect of the invention, recombinant PIV are provided which 
have a selected gene deletion, addition, or substitution, such 
as rPIV having a deletion of the C, D or V ORF(s) . Such 
alternatively mutated clones can be further modified according 
to the present disclosure by introducing one or more mutations 
specifying a ts, ca or att phenotype adopted from a 
biologically derived mutant PIV, as exemplified by the PIV 
recombinants r942,. r992, rl558, r942/992, r992/l558, or 
r942/l558, and r942/992/l558 . In additional aspects of the 
invention, biologically derived mutations will be combined 
with de novo attenuating mutations not found in nature, as 
exemplified by attenuating gene deletions, e.g., of the C, D 
and/or V ORFs. Other types of mutations disclosed herein 
conferring desired phenotypic characteristics will also be 
combined with biologically derived, attenuating mutations, 
similar to the range of combinatorial mutations disclosed for 
recombinant RSV vaccine strains in United States Patent 
Application Serial No. 08/892,403, filed July 15, 1997 
(incorporated herein by reference) . Comparable mutations can 
be readily introduced, e.g., into a chimeric virus, to achieve 
a desired levels of attenuation and immunogenicity in a 
chimeric vaccine strain. In this manner, a large menu of 
mutations are provided within the invention that are useful to 
engineer a wide assemblage of live attenuated rPIV vaccines 
having a desired balance of attenuation and immunogenicity, 
along with other desired phenotypic characteristics. 

Although the foregoing invention has been described 
in some detail by way of illustration and example for purposes 
of clarity of understanding, it will be obvious that certain 
changes and modifications may be practiced within the scope of 
the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: The Government of the United States 

(B) STREET: Box OTT 

(C) CITY: Bethesda 

(D) STATE: Maryland 

(E) COUNTRY: United States of America 

(F) POSTAL CODE (ZIP) : 20892 

(G) TELEPHONE: 

(H) TELEFAX: 

(I) TELEX: 



(ii) TITLE OF INVENTION: PRODUCTION OF ATTENUATED PARAINFLUENZA 
VIRUS VACCINES FROM CLONED NUCLEOTIDE SEQUENCE 

(iii) NUMBER OF SEQUENCES: 74 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: WO 

(B) FILING DATE: 22-MAY-1998 

(C) CL;\SSIFICATI0N: 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/059,385 

(B) FILING DATE: 19-SEP-1997 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/047,575 

(B) FILING DATE: 23-MAY-1997 

(Vii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: King, Jeffrey J. 

(B) REGISTRATION NUMBER: 38,515 

(C) REFERENCE/DOCKET NUMBER: 17634-000320PC 

(viii) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 206-467-9600 

(B) TELEFAX: 415-576-0300 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
GCTGGGTACC GGGCCCGTCG ACGCGTATAT AGTTCCTCCT TTCAGCAAAA AACCCCTCAA 60 
GACCCGTTTA GAGGCCCCAA GGGGTTATGC TACTGCAGGC TCTCCCTTAG CCATCCGAGT 120 
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GGACGTGCX3T CCTCCTTCGG ATGCCCAGGT CGGACCGCGA GGAGGTGGAG ATGCCATGCC 180 

GACCCACCAA ACAAGAGAAG AAACTTGTCT GGGTVATATAA ATTTAACTTT AAATTAACTT 240 

AGGATTAAAG ACATTGACTA GAAGGTCAAG AAAAGGGAAC TCTATAATTT CAAAAATGTT 300 

GAGCCTATTT GATACATTTA ATGCACGTAG GCAAGAAAAC ATAACAAAAT CAGCCGGTGG 360 

AGCTATCATT CCTGGACAGA AAAATACTGT CTCTATATTC GCCCTTGGAC CGACAATAAC 420 

TGATGATAAT GAGAAAATGA CATTAGCTCT TCTATTTCTA TCTCATTCAC TAGATAATGA 480 

GAAACAACAT GCACAAAGGG CAGGGTTCTT GGTGTCTTTA TTGTCAATGG CTTATGCCAA 540 

TCCAGAGCTC TACCTAACAA CAAATGGAAG TAATGCAGAT GTCAAGTATG TCATATACAT 600 

GATTGAGAAA GATCTAAAAC GGCAAAAGTA TGGAGGATTT GTGGTTAAGA CGAGAGAGAT 660 

GATATATGAA AAGACAACTG ATTGGATATT TGGAAGTGAC CTGGATTATG ATCAGGAAAC 720 

TATGTTGCAG AACGGCAGGA ACAATTCAAC AATTGAAGAC CTTGTCCACA CATTTGGGTA 780 

TCCATCATGT TTAGGAGCTC TTATAATACA GATCTGGATA GTTCTGGTCA AAGCTATCAC 840 

TAGTATCTCA GGGTTAAGAA AAGGCTTTTT CACCCGATTG GAAGCTTTCA GACAAGATGG 900 

AACAGTGCAG GCAGGGCTGG TATTGAGCGG TGACACAGTG GATCAGATTG GGTCAATCAT 960 

GCGGTCTCAA CAGAGCTTGG TAACTCTTAT GGTTGAAACA TTAATAACAA TGAATACCAG 1020 

CAGAAATGAC CTCACAACCA TAGTUVAAGAA TATACAAATT GTTGGCAACT ACATAAGAGA 1080 

TGCAGGTCTC GCTTCATTCT TCAATACAAT CAGATATGGA ATTGAGACCA GAATGGCAGC 1140 

TTTGACTCTA TCCACTCTCA GACCAGATAT CAATAGATTA AAAGCTTTGA TGGAACTGTA 1200 

TTTATCAAAG GGACCACGCX3 CTCCTTTCAT CTGTATCCTC AGAGATCCTA TACATGGTGA 1260 

GTTCGCACCA GGCAACTATC CTGCCATATG GAGCTATGCA ATGGGGGTGG CAGTTGTACA 1320 

AAATAGAGCC ATGCAACAGT ATGTGACGGG AAGATCATAT CTAGACATTG ATATGTTCCA 1380 

GCTAGGACAA GCAGTAGCAC GTGATGCCGA AGCTCAAATG AGCTCAACAC TGGAAGATGA 1440 

ACTTGGAGTG ACACACGAAT CTAAAGAAAG CTTGAAGAGA CATATAAGGA ACATAAACAG 1500 

TTCAGAGACA TCTTTCCACA AACCGACAGG TGGATCAGCC ATAGAGATGG CAATAGATGA 1560 

AGAGCCAGAA CAATTCGAAC ATAGAGCAGA TCAAGAACAA AATGGAGAAC CTCAATCATC 1620 

CATAATTCAA TATGCCTGGG CAGAAGGAAA TAGAAGCGAT GATCAGACTG AGCAAGCTAC 1680 

AGAATCTGAC AATATCAAGA CCGAACAACA AAACATCAGA GACAGACTAA ACAAGAGACT 1740 

CAACGACAAG AAGAAACAAA GCAGTCAACC ACCCACTAAT CCCACAAACA GAACAAACCA 1800 

GGACGAAATA GATGATCTGT TTAACGCATT TGGAAGCAAC TAATCGAATC AACATTTTAA 1860 

TCTAAATCAA TAATAAATAA GAAAAACTTA GGATTAAAGA ATCCTATCAT ACCGGAATAT 1920 

AGGGTGGTAA ATTTAGAGTC TGCTTGAAAC TCAATCAATA GAGAGTTGAT GGAAAGCGAT 1980 

GCTAAAAACT ATCAAATCAT GGATTCTTGG GAAGAGGAAT CAAGAGATAA ATCAACTAAT 2040 

ATCTCCTCGG CCCTCAACAT CATTGAATTC ATACTCAGCA CCGACCCCCA AGAAGACTTA 2100 

TCGGAAAACG ACACAATCAA CACAAGAACC CAGCAACTCA GTGCCACCAT CTGTCAACCA 2160 

GAAATCAAAC CAACAGAAAC AAGTGA6AAA GATA6TGGAT CAACTGACAA AAATAGACAG 2220 



wo 98/53078 PCT/US98/10551 

131 

TCTGGGTCAT CACACGAATG TACAACAGAA GCAAAAGATA GAAATATTGA TCAGGAAACT 2280 

GTACAGAGAG GACCTGGGAG AAGAAGCAGC TCAGATAGTA GAGCTGAGAC T6TGGTCTCT 2340 

GGAGGAATCC CCAGAAGCAT CACAGATTCT AAAAATGGAA CCCAAAACAC GGAGGATATT 2400 

GATCTCAATG AAATTAGAAA GATGGATAAG GACTCTATTG AGGGGAAAAT GCGACAATCT 2460 

GCAAATGTTC CAAGCGAGAT ATCA6G7VAGT 6ATGACATAT TTACAACAGA ACAAAGTAGA 2520 

AACAGTGATC ATGGAAGAAG CCTGGAATCT ATCAGTACAC CTGATACAAG ATCAATAAGT 2580 

GTTGTTACTG CTGCAACACC AGATGATGJUV GAAGAAATAC TAATGAAAAA TAGTAGGACA 2640 

AAGAA7U\GTT CTTCAACACA TCAAG7UVGAT GACAAAAGTA TTAAAAAAGG GGGAAAAGGG 2700 

AAAGACTGGT TTAAGAAATC AAAAGATACC GACAACCAGA TACCAACATC AGACTACAGA 2760 

TCCACATCAA AAGGGCAGAA GAAAATCTCA AAGACAACAA CCACCAACAC CGACACAAAG 2820 

GGGCAAACAG AAATACAGAC AGAATCATCA GAAACACAAT CCTCATCATG GAATCTCATC 2880 

ATCGACAACA ACACCGACCG GAACGAACAG ACAAGCACAA CTCCTCCAAC AACAACTTCC 2940 

AGATCAACTT ATACAAAAGA ATCGATCCGA ACAAACTCTG AATCCAAACC CAAGACACAA 3000 

AAGACAAATG GAAAGGAAAG GAAGGATACA GAAGAGAGCA ATCGATTTAC AGAGAGGGCA 3060 

ATTACTCTAT TGCAGAATCT TGGTGTAATT CAATCCACAT CAAAACTAGA TTTATATCAA 3120 

GACAAACGAG TTGTATGTGT AGCAAATGTA CTAAACAATG TAGATACTGC ATCAAAGATA 3180 

GATTTCCTGG CAGGATTAGT CATAGGGGTT TCAATGGACA ACGACACAAA ATTAACACAG 3240 

ATACAAAATG AAATGCTAAA CCTCAAAGCA GATCTAAAGA AAATGGACGA ATCACATAGA 3300 

AGATTGATAG AAAATCAAAG AGAACAACTG TCATTGATCA CGTCACTAAT TTCAAATCTC 3360 

AAAATTATGA CTGAGAGAGG AGGAAAGAAA GACCAAAATG AATCCAATGA GAGAGTATCC 3420 

ATGATCAAAA CAAAATTGAA AGAAGAAAAG ATCAAGAAGA CCAGGTTTGA CCCACTTATG 3480 

GAGGCACAAG GCATTGACAA GAATATACCC GATCTATATC GACATGCAGG AGATACACTA 3540 

GA6AACGATG TACAAGTTAA ATCAGAGATA TTAAGTTCAT ACAATGAGTC AAATGCAACA 3600 

AGACTAATAC CCAAAAAAGT GAGCAGTACA ATGAGATCAC TAGTTGCAGT CATCAACAAC 3660 

AGCAATCTCT CACAAAGCAC AAAACAATCA TACATAAACG AACTCAAACG TTGCAAAAAT 3720 

GATGAAGAAG TATCTGAATT AATGGACATG TTCAATGAAG ATGTCAACAA TTGCC7UVTGA 3780 

TCCAACAAAG AAACGACACC GAACAAACAG ACAAGAAACA ACAGTAGATC AAAACCTGTC 3840 

AACACACACA AAATCAAGCA GAATGAAACA ACAGATATCA ATCAATATAC AAATAAGAAA 3900 

AACTTAGGAT TAAAGAATAA ATTAATCCTT GTCCAAAATG AGTATAACTA ACTCTGCAAT 3960 

ATACACATTC CCAGAATCAT CATTCTCTGA AAATGGTCAT ATAGAACCAT TACCACTCAA 4020 

AGTCAATGAA CAGAGGAAAG CAGTACCCCA CATTAGAGTT GCCAAGATCG GAAATCCACC 4080 

AAAACACGGA TCCCGGTATT TAGATGTCTT CTTACTCGGC TTCTTCGAGA TGGAACGAAT 4140 

CAAAGACAAA TACGGGAGTG TGAATGATCT CGACAGTGAC CCGAGTTACA AAGTTTGTGG 4200 
CTCTGGATCA TTACCJUVTCG GATTGGCTAA GTACACTGGG AATGACCAGG AATTGTTACA 4260 
AGCCGCAACC AAACT6GATA TAGAAGTGAG AAGAACAGTC AAAGCGAAAG AGATGGTTGT 4320 
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TTACACGGTA CAAAATATAA AACCAGAACT GTACCCATGG TCCAATAGAC TAAGAAAAGG 4380 

AATGCTGTTC GATGCCAACA AAGTTGCTCT T6CTCCTCAA TGTCTTCCAC TAGATAGGAG 4440 

CATAAAATTT AGAGTAATCT TCGTGAATTG TACGGCAATT GGATCAATAA CCTTGTTCAA 4500 

AATTCCTAAG TCAATGGCAT CACTATCTCT ACCCAACACA ATATCAATCA ATCTGCAGGT 4560 

ACACATAAAA ACAGGGGTTC AGACTGATTC TA;U\GGGATA GTTCAAATTT TGGATGAGAA 4620 

AGGCGAAAAA TCACTG7UVTT TCATGGTCCA TCTCGGATTG ATCAAAAGAA AAGTAGGCAG 4680 

AATGTACTCT GTTGAATACT GTAAACAGAA AATCGAGAJ\A ATGAGATTGA TATTTTCTTT 4740 

AGGACTAGTT GGAGGAATCA GTCTTCATGT CAATGCAACT GGGTCCATAT CAAAAACACT 4800 

AGCAAGTCAG CTGGTATTCA AAAGAGAGAT TTGTTATCCT TTAATGGATC TAAATCCGCA 4860 

TCTCAATCTA GTTATCTGGG CTTCATCAGT AGAGATTACA AGAGTGGATG CAATTTTCCA 4920 

ACCTTCTTTA CCTGGCGAGT TCAGATACTA TCCTAATATT ATTGCAAAAG GAGTTGGGAA 4980 

AATCAAACAA TGGAACTAGT AATCTCTATT TTAGTCCGGA CGTATCTATT AAGCCGAAGC 5040 

AAATAAAGGA TAATCAAAAA CTTAGGACAA AAGAGGTCAA TACCAACAAC TATTAGCAGT 5100 

CACACTCGCA AGAATAAGAG AGAAGGGACC AAAAAAGTCA AATAGGAGAA ATCAAAAGAA 5160 

AAGGTACAGA ACACCAGAAC AAGAAAATCA AAACATCCAA CTCACTCAAA ACAAAAATTC 5220 

CAA7JVGAGAC CGGCAACACA ACAAGCACTG AACACAATGC CAACTTCAAT ACTGCTAATT 5280 

ATTACAACCA TGATCATGGC ATCTTTCTGC CAAATAGATA TCACAAAACT ACAGCACGTA 5340 

GGTGTATTGG TCAACAGTCC CAAAGGGATG AAGATATCAC AAAACTTTGA AACAAGATAT 5400 

CTAATTTT6A GCCTCATACC A7UUUVTAGAA GACTCTAACT CTTGTGGTGA CCAACAGATC 5460 

AAGCAATACA AGAAGTTATT GGATAGACTG ATCATCCCTT TATATGATGG ATTAAGATTA 5520 

CAGAAAGATG TGATAGTAAC CAATCAAGAA TCCAATGAAA ACACTGATCC CAGAACAAAA 5580 

CGATTCTTTG 6AGGGGTAAT TGGAACCATT GCTCTGGGAG TAGCAACCTC AGCACAAATT 5640 

ACAGCGGCAG TTGCTCTGGT TGAAGCCAAG CAGGCAAGAT CAGACATCGA AAAACTCAAA 5700 

GAAGCAATTA GGGACACAAA CAAAGCAGTG CAGTCAGTTC AGAGCTCCAT AGGAAATTTA 5760 

ATAGTAGCAA TTAAATCAGT CCAGGATTAT GTTAACAAAG AAATCGTGCC ATCGATTGCG 5820 

AGGCTAGGTT GTGAAGCAGC AGGACTTCAA TTAGGAATTG CATTAACACA GCATTACTCA 5880 

GAATTAACAA ACATATTTGG TGATAACATA GGATCGTTAC AAGAAAAAGG AATAAAATTA 5940 

CAAGGTATAG CATCATTATA CCGCACAAAT ATCACAGAAA TATTCACAAC ATCAACAGTT 6000 

GATAAATATG ATATCTATGA TCTGTTATTT ACAGAATCAA TAAAGGTGAG AGTTATAGAT 6060 

GTTGACTTGA ATGATTACTC AATCACCCTC CAAGTCAGAC TCCCTTTATT AACTAGGCTG 6120 

CTGAACACTC AGATCTACAA AGTAGATTCC ATATCATATA ACATCCAAAA CAGAGAATGG 6180 

TATATCCCTC TTCCCAGCCA TATCATGACG AAAGGGGCAT TTCTAGGTGG AGCAGACGTC 6240 

AAAGAATGTA TAGAAGCATT CAGCAGCTAT ATATGCCCTT CTGATCCAGG ATTTGTATTA 6300 

AACCATGAAA TAGAGAGCTG CTTATCAGGA AACATATCCC AATGTCCAAG AACAACGGTC 6360 

ACATCAGACA TTGTTCCAAG ATATGCATTT GTCAATGGAG GAGTGGTTGC AAACTGTATA 6420 
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ACAACCACCT 


GTACATGCAA 


CGGAATTGGT AATAGAATCA ATCAACCACC 


TGATCAAGGA 


6480 


GTAAAAATTA 


TAACACATAA AGAAT6TAGT ACAATAGGTA 


TCAACGGAAT 


GCTGTTCAAT 


6540 


ACAAATAAAG 


AAGGAACTCT 


TGCATTCTAT ACACCAZUVTG 


ATATAACACT 


AAACAATTCT 


6600 


GTTGCACTTG 


ATCCAATTGA CATATCAATC GAGCTCAACA AGGCCAAATC 


AGATCTAGAA 


6660 


GAATCAAAAG 


AATGGATAAG 


AAGGTCAAAT CAAAAACTAG 


ATTCTATTGG 


AAATTGGCAT 


6720 


CAATCTAGCA 


CTACAATCAT 


AATTATTTTG ATAATGATCA 


TTATATT6TT 


TATAATTAAT 


6780 


ATAACGATAA 


TTACAATTGC 


AATTAAGTAT TACAGAATTC 


AAAAGAGAAA 


TCGAGTGGAT 


6840 


CAAAATGACA 


AGCCATATGT 


ACTAACAAAC AAATAACATA 


TCTACAGATC 


ATTAGATATT 


6900 


AAAATTATAA 


AAAACTTAGG 


AGTAAAGTTA CGCAATCCAA 


CTCTACTCAT ATAATTGAGG 


6960 


AAGGACCCAA 


TAGACAAATC 


CAAATTCGAG ATGGAATACT 


GGAAGCATAC 


CAATCACGGA 


7020 


AAGGATGCTG 


GTAATGAGCT 


GGAGACGTCT ATGGCTACTC 


ATGGCAACAA 


GCTCACTAAT 


7080 


AAGATAATAT 


ACATATTATG 


GACAATAATC CTGGTGTTAT 


TATCAATAGT 


CTTCATCATA 


7140 


GTGCTAATTA 


ATTCCATCAA AAGTGAAAAG GCCCACGAAT 


CATTGCTGCA 


AGACATAAAT 


7200 


AATGAGTTTA 


TGGAAATTAC 


AGAAAAGATC CAAATG6CAT 


CGGATAATAC 


CAATGATCTA 


7260 


ATACAGTCAG 


6AGTGAATAC 


AAGGCTTCTT ACAATTCAGA 


GTCATGTCCA 


GAATTACATA 


7320 


CCAATATCAT 


TGACACAACA 


GATGTCAGAT CTTAGGAAAT 


TCATTAGTGA 


AATTACT^TT 


7380 


AGAAAT6ATA 


ATCAAGAAGT 


GCTGCCACAA AGAATAACAC 


ATGATGTAGG 


TATAAAACCT 


7440 


TTAAATCCAG 


ATGATTTTTG 


GAGATGCACG TCTGGTCTTC 


CATCTTTAAT 


GAAAACTCCA 


7500 


AAAATAAGGT 


TAATGCCAGG 


GCCGGGATTA TTAGCTATGC 


CAACGACTGT 


TGATGGCTGT 


7560 


GTTAGAACTC 


CGTCTTTAGT 


TATAAATGAT CTGATTTATG 


CTTATACCTC 


AAATCTAATT 


7620 


ACTCGAGGTT 


GTCAGGATAT 


AGGAAAATCA TATCAAGTCT 


TACAGATAGG 


GATAATAACT 


7680 


GTAAACTCAG 


ACTTGGTACC 


TGACTTAAAT CCTAGGATCT 


CTCATACCTT 


TAACATAAAT 


7740 


GACAATAGGA 


AGTCATGTTC 


TCTAGCACTC CTAAATATAG 


ATGTATATCA 


ACTGTGTTCA 


7800 


ACTCCCAAAG 


TTGATGAAAG 


ATCAGATTAT 6CATCATCAG 


GCATAGAAGA 


TATTGTACTT 


7860 


GATATTGTCA 


ATTATGATGG 


TTCAATCTCA ACAACAAGAT 


TTAAGAATAA 


TAACATAAGC 


7920 


TTTGATCAAC 


CATATGCTGC 


ACTATACCCA TCTGTTGGAC 


CAGGGATATA 


CTACAAAGGC 


7980 


AAAATAATAT 


TTCTCGGGTA 


TGGAGGTCTT GAACATCCAA 


TAAATGAGAA 


TGTAATCTGC 


8040 


AACACAACTG 


6GTGCCCCGG 


GAAAACACAG AGAGACTGTA 


ATCAAGCATC 


TCATAGTACT 


8100 


TGGTTTTCAG 


ATAGGAGGAT 


GGTCAACTCC ATCATTGTTG 


TTGACAAAGG 


CTTAAACTCA 


8160 


ATTCCAAAAT 


TGAAAGTATG 


GACGATATCT ATGCGACAAA 


ATTACTGGGG 


GTCAGAAGGA 


8220 


AGGTTACTTC 


TACTAGGTAA 


CAAGATCTAT ATATATACAA GATCTACAAG 


TTGGCATAGC 


8280 


AAGTTACAAT 


TAGGAATAAT 


TGATATTACT GATTACAGTG 


ATATAAGGAT 


AAAATGGACA 


8340 


TGGCATAATG 


TGCTATCAAG 


ACCAGGAAAC AATGAATGTC 


CATGGGGACA 


TTCATGTCCA 


8400 


GATGGATGTA 


TAACAGGAGT 


ATATACTGAT GCATATCCAC 


TCAATCCCAC 


AGGGAGCATT 


8460 


GTGTCATCTG 


TCATATTAGA 


CTCACAAAAA TCX3AGAGTGA 


ACCCAGTCAT 


AACTTACTCA 


8520 
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ACAGCAACCG 


AAAGAGTAAA 


CGAGCTGGCC 


ATCCTAAACA GAACACTCTC . 


AGCTGGATAT 


8580 


ACAACAACAA 


GCTGCATTAC 


ACACTATAAC 


TUU^GGATATT GTTTTCATAT 


AGTAGAAATA 


8640 


AATCATATU^ 


GCTTAAACAC 


ATTTCAACCC 


ATGTTGTTCA AAACAGAGAT 


TCCAAA7UVGC 


8700 


TGCAGTTAAT 


CATAATTAAC 


CATAATATGC ATCAATCTAT CTATAATACA AGTATATGAT 


8760 


AAGTAATCAG 


CAATCAGACA 


ATAGACAAAA 


GGGAAATATA AAAAACTTAG 


GAGCAAAGCG 


8820 


TGCTCGGGAA ATGGACACTG 


AATCTAACAA 


TGGCACTGTA TCTGACATAC 


TCTATCCTGA 


8880 


GTGTCACCTT 


AACTCTCCTA 


TCGTTAAAGG 


TAAAATAGCA CAATTACACA 


CTATTATGAG 


8940 


TCTACCTCAG 


CCTTATGATA 


TGGATGACGA 


CTCAATACTA GTTATCACTA 


GACAGAAAAT 


9000 


AAAACTTAAT 


AAATTG6ATA 


AAAGACAACG 


ATCTATTAGA AGATTAAAAT 


TAATATTAAC 


9060 


TGAAAAAGTG 


AATGACTTAG 


GAAAATACAC 


ATTTATCAGA TATCCAGAAA 


TGTCAT^GA 


9120 


AATGTTCAAA 


TTATATATAC 


CTGGTATTAA 


CAGTAAAGTG ACTGAATTAT 


TACTTAAAGC 


9180 


AGATAGAACA 


TATAGTCMA 


TGACTGATGG 


ATTAAGAGAT CTATGGATTA ATGTGCTATC 


9240 


AAAATTAGCC 


TCAAAAAATG 


ATGGAAGCAA 


TTATGATCTT AATGAAGAAA 


TTAATAATAT 


9300 


ATCGAAAGTT 


CACACAACCT 


ATAAATCAGA 


TAAATGGTAT AATCCATTCA 


AAACATGGTT 


9360 


TACTATCAAG 


TATGATATGA 


GAAGATTACA 


AAAAGCTCGA AATGAGATCA 


CTTTTAATGT 


9420 


TGGGAAGGAT 


TATAACTTGT 


TAGAAGACCA 


GAAGAATTTC TTATTGATAC 


ATCCAGAATT 


9480 


GGTTTTGATA 


TTAGATAAAC 


AAAACTAT7UV 


TGGTTATCTA ATTACTCCTG 


AATTAGTATT 


9540 


GATGTATTGT 


GACGTAGTCG 


AAGGCCGATG 


GAATATAAGT GCATGTGCTA 


AGTTAGATCC 


9600 


AAAATTACAA 


TCTATGTATC 


AGAAAGGTAA 


TAACCTGTGG GAAGTGATAG 


ATAAATTGTT 


9660 


TCCAATTATG 


GGAGAAAAGA 


CATTTGATGT 


GATATCGTTA TTAGAACCAC 


TTGCATTATC 


9720 


CTTAATTCAA 


ACTCATGATC 


CTGTTAAACA 


ACTAAGAGGA GCTTTTTTAA 


ATCATGTGTT 


9780 


ATCC6AGATG 


GAATTAATAT 


TTGAATCTAG 


AGAATCGATT AAGGAATTTC 


TGAGTGTAGA 


9840 


TTACATTGAT 


AAAATTTTAG 


ATATATTTAA 


TAAGTCTACA ATAGATGAAA 


TAGCAGAGAT 


9900 


TTTCTCTTTT 


TTTAGAACAT 


TTGGGCATCC 


TCCATTAGAA GCTAGTATTG 


CAGCAGAAAA 


9960 


GGTTAGAAAA TATATGTATA TTGGAAAACA ATTAAAATTT GACACTATTA ATAAATGTCA 


10020 


TGCTATCTTC 


TGTACAATAA 


TAATTAACGG 


ATATAGAGAG AGGCATGGTG 


GACAGTGGCC 


10080 


TCCTGTGACA 


TTACCTGATC 


ATGCACACGA ATTCATCATA AATGCTTACG 


GTTCAAACTC 


10140 


TGCGATATCA 


TATGAAAATG 


CTGTTGATTA 


TTACCAGAGC TTTATAGGAA 


TAAAATTCAA 


10200 


TAAATTCATA 


GAGCCTCAGT 


TAGATGAGGA 


TTTGACAATT TATATGAAAG 


ATAAAGCATT 


10260 


ATCTCCAAAA 


AAATCAAATT 


GGGACACAGT 


TTATCCTGCA TCTAATTTAC 


TGTACCGTAC 


10320 


TAACGCATCC 


AACGAATCAC 


GAAGATTAGT 


TGT^GTATTT ATAGCAGATA 


GTAAATTTGA 


10380 


TCCTCATCAG 


ATATTGGATT 


ATGTAGAATC 


TGGGGACTGG TTAGATGATC 


CAGAATTTAA 


10440 


TATTTCTTAT 


AGTCTTAAAG 


AAAAAGAGAT 


CAAACAGGAA GGTAGACTCT 


TTGCAAAAAT 


10500 


GACATACAAA 


ATGAGAGCTA 


CACAAGTTTT 


ATCAGAGACC CTACTTGCAA 


ATAACATAGG 


10560 


TUJUITTCTTT 


CAAGAAAATG 


GGATGGTGAA 


GGGAGAGATT GAATTACTTA 


AGAGATTAAC 


10620 
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AACCATATCA 


ATATCAGGAG 


TTCCACGGTA 


TAATGAAGTG TACAATAATT 


CTAAAAGCCA 


10680 


TACAGATGAC 


CTTAAAACCT 


ACAATAAAAT 


AAGTAATCTT AATTTGTCTT 


CTAATCAGAA 


10740 


ATCAAAGAAA 


TTTGAATTCA 


AGTCAACGGA 


TATCTACAAT GATGGATACG 


AGACTGTGAG 


10800 


CTGTTTCCTA 


ACAACAGATC 


TCAAAAAATA 


CTGTCTTAAT TGGAGATATG 


AATCAACAGC 


10860 


TCTATTTGGA 


GAAACTTGCA 


ACCAAATATT 


TGGATTAAAT AAATTGTTTA ATTGGTTACA 


10920 


CCCTCGTCTT 


GAAGGAAGTA 


CAATCTATGT 


AGGTGATCCT TACTGTCCTC 


CATCAGATAA 


10980 


AGAACATATA 


TCATTAGAGG 


ATCACCCTGA 


TTCTGGTTTT TACGTTCATA 


ACCCAAGAGG 


11040 


GGGTATAGAA 


GGATTTTGTC 


AAAAATTATG 


GACACTCATA TCTATAAGTG 


CAATACATCT 


11100 


AGCAGCTGTT 


AGAATAGGCG 


TGAGGGTGAC 


TGCAATGGTT CAAGGAGACA ATCAAGCTAT 


11160 


AGCTGTAACC 


ACAAGAGTAC 


CCAACAATTA 


TGACTACAGA GTTAAGAAGG 


AGATAGTTTA 


11220 


TAAAGATGTA 


GTGAGATTTT 


TTGATTCATT 


AAGAGAAGTG ATGGATGATC 


TAGGTCATGA 


11280 


ACTTAAATTA 


AATGAAACGA 


TTATAAGTAG 


CAAGATGTTC ATATATAGCA AAAGAATCTA 


11340 


TTATGATGGG 


AGAATTCTTC 


CTCAAGCTCT 


AAAAGCATTA TCTAGATGTG 


TCTTCTGGTC 


11400 


AGAGACAGTA 


ATAGACGAAA 


CAAGATCAGC 


ATCTTCAAAT TTGGCAACAT 


CATTTGCAAA 


11460 


AGCAATTGAG 


AATGGTTATT 


CACCTGTTCT 


AGGATATGCA TGCTCAATTT 


TTAAGAATAT 


11520 


TC7VACAACTA 


TATATTGCCC 


TTGGGATGAA 


TATCAATCCA ACTATAACAC 


AGAATATCAG 


11580 


AGATCAGTAT 


TTTAGGAATC 


CAAATTGGAT 


GCAATATGCC TCTTTTU^TAC 


CTGCTAGTGT 


11640 


TGGGGGATTC 


AATTACATGG 


CCATGTCAAG 


ATGTTTTGTA AGGAATATTG 


GTGATCCATC 


11700 


AGTTGCCGCA 


TTGGCTGATA 


TTAAAAGATT 


TATTAAGGCG AATCTATTAG 


ACCGAAGTGT 


11760 


TCTTTATAGG 


ATTATGAATC 


AAGAACCAGG 


TGAGTCATCT TTTTTGGACT 


GGGCTTCAGA 


11820 


TCCATATTCA 


TGCAATTTAC 


CACAATCTCA 


AAATATAACC ACCATGATAA AAAATATAAC 


11880 


AGCAAGGAAT 


GTATTACAAG 


ATTCACCAAA 


TCCATTATTA TCTGGATTAT 


TCACAAATAC 


11940 


AATGATAGAA 


GAAGATGAAG 


AATTAGCTGA 


GTTCCTGATG GACAGGT^GG 


TAATTCTCCC 


12000 


TAGAGTTGCA 


CATGATATTC 


TAGATAATTC 


TCTCACAGGA ATTAGAAATG 


CCATAGCTGG 


12060 


AATGTTAGAT 


ACGACAAAAT 


CACTAATTCG 


GGTT6GCATA AATAGAGGAG 


GACTGACATA 


12120 


TAGTTTGTTG 


AGGAAAATCA 


GTAATTACGA 


TCTAGTACAA TATGAAACAC 


TAAGTAGGAC 


12180 


TTTGCGACTA 


ATTGTAAGTG 


ATAAAATCAA 


GTATGAAGAT AT6TGTTCGG 


TAGACCTTGC 


12240 


CATAGCATTG 


CGACAAAAGA 


TGTGGATTCA 


TTTATCAGGA GGAAGGATGA 


TAAGTGGACT 


12300 


TGAAACGCCT 


GACCCATTAG 


AATTACTATC 


TGGGGTAGTA ATAACAGGAT 


CAGAACATTG 


12360 


TAAAATATGT 


TATTCTTCAG 


ATGGCACAAA 


CCCATATACT TGGATGTATT 


TACCCGGTAA 


12420 


TATCAAAATA 


GGATCAGCAG 


AAACAGGTAT 


ATCGTCATTA AGAGTTCCTT 


ATTTTGGATC 


12480 


AGTCACTGAT 


GAAAGATCTG 


AAGCACAATT 


AGGATATATC AAGAATCTTA 


GTAAACCTGC 


12540 


AAAA6CCGCA 


ATAAGAATAG 


CAATGATATA 


TACATGGGCA TTTGGTAATG 


ATGAGATATC 


12600 


TTGGATGGAA GCCTCACAGA TAGCACAAAC 


ACGTGCAAAT TTTACACTAG 


ATAGTCTCAA 


12660 


AATTTTAACA 


CCGGTAGCTA 


CATCAACAAA 


TTTATCACAC AGATTAAAGG 


ATACTGCAAC 


12720 
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TCA6AT6AAA 


TTCTCCAGTA 


CATCATTGAT 


CAGAGTCAGC AGATTCATAA 


CAATGTCCAA 


X2780 


TGATAACATG 


TCTATCAAAG 


AAGCTAATGA 


AACCAAAGAT ACTAATCTTA 


TTTATCAACA 


12840 


AATAATGTTA ACAGGATTAA 


GTGTTTTCGA 


ATATTTATTT 


AGATTAAAAG 


AAACCACAGG 


12900 


ACACAACCCT 


ATAGTTATGC 


ATCTGCACAT 


AGAAGATGAG 


TGTTGTATTA 


AAGAAAGTTT 


12960 


TAATGATGAA 


CATATTAATC 


CAGAGTCTAC 


ATTAGAATTA 


ATTCGATATC 


CTGAAAGTAA 


13020 


TGAATTTATT 


TATGATAAAG 


ACCCACTCAA 


AGATGT6GAC 


TTATCAAAAC 


TTATGGTTAT 


13080 


TAAAGACCAT 


TCTTACACAA 


TTGATATGAA 


TTATTG66AT 


GATACTGACA 


TCATACATGC 


13140 


AATTTCAATA 


TGTACTGCAA 


TTACAATAGC 


AGATACTATG 


TCACAATTAG 


ATCGAGATAA 


13200 


TTTAAAAGAG 


ATAATAGTTA 


TTGCAAATGA 


TGATGATATT 


AATAGCTTAA 


TCACTGAATT 


13260 


TTTGACTCTT 


GACATACTTG 


TATTTCTCAA 


GACATTTGGT 


GGATTATTAG 


TAAATCAATT 


13320 


TGCATACACT 


CTTTATAGTC 


TAAAAATAGA 


AGGTAGGGAT 


CTCATTTGGG 


ATTATATAAT 


13380 


GAGAACACTG 


AGAGATACTT 


CCCATTCAAT 


ATTAAAAGTA 


TTATCTT^TG 


CATTATCTCA 


13440 


TCCTAAAGTA 


TTCAAGAGGT 


TCTGGGATTG 


TGGAGTTTTA AACCCTATTT 


ATGGTCCTAA 


13500 


TACTGCTAGT 


CAAGACCAGA 


TAAAACTTGC 


CCTATCTATA 


TGTGAATATT 


CACTAGATCT 


13560 


ATTTAT6AGA 


GAATGGTTGA 


ATGGTGTATC 


ACTTGA7UVTA 


TACATTTGTG 


ACAGCGATAT 


13620 


GGAAGTTGCA 


AATGATAGGA 


AACAAGCCTT 


TATTTCTAGA 


CACCTTTCAT 


TTGTTTGTTG 


13680 


TTTAGCAGAA ATTGCATCTT 


TCGGACCTAA 


CCTGTTAAAC 


TTAACATACT 


TGGAGAGACT 


13740 


TGATCTATTG 


AAACAATATC 


TTGAATTAAA 


TATTAAAGAA 


GACCCTACTC 


TTAAATATGT 


13800 


ACAAATATCT 


6GATTATTAA 


TTAAATCGTT 


CCCATCAACT 


GTAACATAC6 


TAAGAAAGAC 


13860 


TGCAATCAAA 


TATCTAAGGA 


TTCGCGGTAT 


TAGTCCACCT 


GA6GTAATTG 


ATGATT6GGA 


13920 


TCCGGTAGAA 


GATGAAAATA 


TGCTGGATAA 


CATTGTCAAA 


ACTATAAATG 


ATAACTGTAA 


13980 


TAAAGATAAT 


AAAGGGAATA 


AAATTAACAA 


TTTCTGGGGA 


CTAGCACTTA 


AGAACTATCA 


14040 


AGTCCTTAAA 


ATCAGATCTA 


TAACAAGTGA 


TTCTGATGAT 


AATGATAGAC 


TAGATGCTAA 


14100 


TACAAGTGGT 


TTGACACTTC 


CTCAAGGAG6 


GAATTATCTA 


TCGCATCAAT 


TGAGATTATT 


14160 


CGGAATCAAC 


AGCACTAGTT 


GTCTGAAAGC 


TCTTGAGTTA 


TCACAAATTT 


TAATGAAGGA 


14220 


AGTCAATAAA GACAAGGACA 


GGCTCTTCCT 


GGGAGAAGGA 


GCAGGAGCTA 


TGCTAGCATG 


14280 


TTATGATGCC 


ACATTAGGAC 


CTGCAGTTAA 


TTATTATAAT 


TCAGGTTTGA 


ATATAACAGA 


14340 


TGTAATTGGT 


CAACGAGAAT 


TGAAAATATT 


TCCTTCAGAG 


GTATCATTAG 


TAGGTAAAAA 


14400 


ATTAGGAAAT 


GTGACACAGA 


TTCTTAACAG 


GGTAAAAGTA 


CTGTTCAATG 


GGAATCCTAA 


14460 


TTCAACATGG 


ATAGGAAATA 


TGGAATGTGA 


GAGCTTAATA 


TGGAGTGAAT 


TAAATGATAA 


14520 


GTCCATTGGA 


TTAGTACATT 


GTGATATGGA 


AGGAGCTATC 


GGTAAATCAG 


AAGAAACTGT 


14580 


TCTACATGAA 


CATTATAGTG 


TTATAAGAAT 


TACATACTTG 


ATTGGGGATG 


ATGATGTTGT 


14640 


TTTAGTTTCC 


A7UVATTATAC 


CTACAATCAC 


TCCGAATTGG 


TCTAGAATAC 


TTTATCTATA 


14700 


TAAATTATAT 


TGGAAAGATG 


TAAGTATAAT 


ATCACTCAAA 


ACTTCTAATC 


CTGCATCAAC 


14760 


AGAATTATAT 


CTAATTTCGA 


AAGATGCATA 


TTGTACTATA 


ATGGAACCTA 


GTGAAATTGT 


14820 
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TTTATCAAAA 


CTTAAAA6AT 


TGTCACTCTT 


GGAAGAAAAT 


AATCTATTAA 


AATGGATCAT 


14880 


TTTATCAAAG 


AAGAGGAATA 


ATGAATGGTT 


ACATCATGAA 


ATCAAAGAAG 


GAGAAAGAGA 


14940 


TTATGGAATC 


ATGAGACCAT 


ATCATATGGC 


ACTACAAATC 


TTTGGATTTC 


AAATCAATTT 


15000 


AAATCATCTG 


GCGAAAGAAT 


TTTTATCAAC 


CCCAGATCTG 


ACTAATATCA 


ACAATATAAT 


15060 


CCAAAGTTTT 


CAGCGAACAA 


TAAAGGATGT 


TTTATTTGAA 


TGGATTAATA 


TAACTCATGA 


15120 


TGATAAGAGA 


CATAAATTAG 


GCGGAAGATA 


TAACATATTC 


CCACTGAAAA 


ATAAGGGAAA 


15180 


GTTAAGACTG 


CTATCGAGAA 


GACTAGTATT 


AAGTTGGATT 


TCATTATCAT 


TATCGACTCG 


15240 


ATTACTTACA 


GGTCGCTTTC 


ctgatga;^ 


ATTTGAACAT AGAGCACAGA 


CTGGATATGT 


15300 


ATCATTAGCT 


GATACTGATT 


TAGAATCATT 


AAAGTTATTG 


TCGAAAAACA 


TCATTAAGAA 


15360 


TTACAGAGAG 


TGTATAGGAT 


CAATATCATA 


TTGGTTTCTA 


ACCAAAGAAG 


TTAAAATACT 


15420 


TATGAAATTG 


ATCGGTGGTG 


CTAAATTATT 


AGGAATTCCC 


AGACAATATA 


AAGAACCCGA 


15480 


AGACCAGTTA 


TTAGAAAACT 


ACAATCAACA 


TGATGAATTT 


GATATCGATT 


AAAACATA7UV 


15540 


TACAATGAAG 


ATATATCCTA 


ACCTTTATCT 


TTAAGCCTAG 


GAATAGACAA 


AAAGTAAGAA 


15600 


AAACATGTAA 


TATATATATA 


CCAAACAGAG 


TTGTTCTCTT 


GTTTGGTTAT 


AGTGAGTCGT 


15660 


ATTACAATC 












15669 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
AATACGACTC ACTATAACCA T^CAAGAGAA C 31 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CCAAGTACTA TGAGATGCTT CATT 24 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 



CCCTATAATT TCAACATGTT GAGCCTATTT G 



31 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQXTCNCE DESCRIPTION: SEQ ID NO: 5: 
GATTAAAATG TTGGTCGACT TAGTTGCTTC C 31 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CCATAGAGAG TCCATGGAAA GC6ACGCTAA AAACTATC 38 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CGGTGTCGTT TCTTTGTCGA CTCATTGGCA ATTGTTG 37 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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GCAAAGCGTG CCCGGGCCAT GGACACTGAA TCTAACAATG GC 



42 



(2) INF0RM21TI0N FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GAAATTCCTT AATCGATTCT CTAGATTC 28 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCCATCAACT GTAACATACG TAAGAAAGAC 30 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGTTAGGATA TGTCGACATT GTATTTATG 29 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGGGTTATGC TACTGCAGGC TTTTTTTCTC CCTTAGCCAT CCG 43 
(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CTCCATTCTA GANTTATAAA AATTATAGAG TTCCC 35 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15660 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TAATACGACT CACTATAACC AAACAAGAGA AGAAACTTGT CTGGGAATAT AAATTTAACT 60 

TTAAATTAAC TTAGGATTAA AGACATTGAC TAGAAGGTCA AGAAAAGGGA ACTCTATAAT 120 

TTCAAAAATG TTGAGCCTAT TTGATACATT TAATGCACGT AGGCAAGAAA ACATAACAAA 180 

ATCAGCCGGT GGAGCTATCA TTCCTGGACA GAAAAATACT GTCTCTATAT TCGCCCTTGG 240 

ACCGACAATA ACTGATGATA ATGAGAAAAT GACATTAGCT CTTCTATTTC TATCTCATTC 300 

ACTAGATAAT GAGAAACAAC ATGCACAAAG GGCAGGGTTC TTGGTGTCTT TATTGTCAAT 360 

GGCTTATGCC AATCCAGAGC TCTACCTAAC AACAAATGGA AGTAATGCAG ATGTCAAGTA 420 

TGTCATATAC ATGATTGAGA AAGATCTAAA ACGGCAAAAG TATGGAGGAT TTGTGGTTAA 480 

GACGAGAGAG ATGATATATG AAAAGACAAC TGATTGGATA TTTGGAAGTG ACCTGGATTA 540 

TGATCAGGAA ACTATGTTGC AGAACGGCAG GAACAATTCA ACAATTGAAG ACCTTGTCCA 600 

CACATTTGGG TATCCATCAT GTTTAGGAGC TCTTATAATA CAGATCTGGA TAGTTCTGGT 660 

CAAAGCTATC ACTAGTATCT CAGGGTTAAG AAAAGGCTTT TTCACCCGAT TGGAAGCTTT 720 

CAGACAAGAT GGAACAGTGC AGGCAGGGCT GGTATTGAGC GGTGACACAG TGGATCAGAT 780 

TGGGTCAATC ATGCGGTCTC AACAGAGCTT GGTAACTCTT ATGGTTGAAA CATTAATAAC 840 

AATGAATACC AGCAGAAATG ACCTCACAAC CATAGAAAAG AATATACAAA TTGTTGGCAA 900 

CTACATAAGA GATGCAGGTC TCGCTTCATT CTTCAATACA ATCAGATATG GAATTGAGAC 960 

CAGAATGGCA GCTTTGACTC TATCCACTCT CAGACCAGAT ATCAATAGAT TAAAAGCTTT 1020 

GATGGAACTG TATTTATCAA AGGGACCACG CGCTCCTTTC ATCTGTATCC TCAGAGATCC 1080 

TATACATGGT GAGTTCGCAC CAGGCAACTA TCCTGCCATA TGGAGCTATG CAATGGGGGT 1140 

GGCAGTTGTA CAAAATAGAG CCATGCAACA GTATGTGACG GGAAGATCAT ATCTAGACAT 1200 

TGATATGTTC CAGCTAGGAC AAGCAGTAGC ACGTGATGCC GAAGCTCAAA TGAGCTCAAC 1260 
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ACTGGAAGAT 


GAACTTGGAG 


TGACACACGA 


ATCTAAAGAA 


AGCTTGAAGA 


GACATATAAG 


1320 


GAACATAAAC 


AGTTCAGAGA 


CATCTTTCCA 


CAAACCGACA 


GGTGGATCAG 


CCATAGAGAT 


1380 


GGCAATAGAT 


GAAGAGCCAG 


AACAATTCGA 


ACATAGAGCA 


GATCAAGAAC 


AAAATGGAGA 


1440 


ACCTCAATCA TCCATAATTC 


AATATGCCTG 


GGCAGAAGGA AATAGAAGCG 


ATGATCAGAC 


1500 


TGAGCAAGCT 


ACAGAATCTG 


ACAATATCAA 


GACCGAACAA 


CAAAACATCA 


GAGACAGACT 


1560 


AAACAAGAGA 


CTCAACGACA 


AGAAGAAACA 


AAGCAGTCAA 


CCACCCACTA ATCCCACAAA 


1620 


CAGAACAAAC 


CAGGACGAAA 


TAGATGATCT 


GTTTAACGCA 


TTTGGAAGCA 


ACTAATCGAA 


1680 


TCAACATTTT 


AATCTAAATC 


AATAATAAAT 


AAGAAAAACT 


TAGGATTAAA 


GAATCCTATC 


1740 


ATACCGGAAT 


ATAGGGTGGT 


AAATTTAGAG 


TCTGCTTG7VA 


ACTCAATCAA 


TAGAGAGTTG 


1800 


ATGGAAAGCG 


ATGCTAAAAA 


CTATCAAATC 


ATGGATTCTT 


GGGAAGAGGA 


ATCAAGAGAT 


1860 


AAATCAACTA 


ATATCTCCTC 


GGCCCTCAAC 


ATCATTGAAT 


TCATACTCAG 


CACCGACCCC 


1920 


CAAGAAGACT 


TATCGGAAAA 


CGACACAATC 


AACACAAGAA 


CCCAGCAACT 


CAGTGCCACC 


1980 


ATCTGTCAAC 


CAGAAATCAA ACCAACAGAA ACAAGTGAGA AAGATAGTGG ATCAACTGAC 


2040 


AAAAATAGAC 


AGTCTGGGTC 


ATCACACGAA 


TGTACAACAG 


AAGCAAAAGA 


TAGAAATATT 


2100 


GATCA6GAAA 


CTGTACAGAG 


AGGACCTGGG 


AGAAGAAGCA 


GCTCAGATAG 


TAGAGCTGAG 


2160 


ACTGTGGTCT 


CTGGAGGAAT 


CCCCAGAAGC 


ATCACAGATT 


CTAAAAATGG 


AACCCAAAAC 


2220 


ACGGAGGATA 


TTGATCTCAA 


TGAAATTAGA 


AAGATGGATA 


AGGACTCTAT 


TGAGGGGAAA 


2280 


ATGCGACAAT 


CTGCAAATGT 


TCCAAGCGAG 


ATATCAGGAA 


GTGATGACAT 


ATTTACAACA 


2340 


GAACAAAGTA 


GAAACAGTGA 


TCATGGAAGA 


AGCCTGGAAT 


CTATCAGTAC 


ACCTGATACA 


2400 


AGATCAATAA 


GTGTTGTTAC 


TGCTGCAACA 


CCAGATGATG 


AAGAAGAAAT 


ACTAATGAAA 


2460 


AATAGTAGGA 


CAAAGAAAAG 


TTCTTCAACA 


CATCAAGAAG 


ATGACAAAAG 


AATTAAAAAA 


2520 


GGGGGAAAAG 


GGAAAGACTG 


GTTTAAGAAA 


TCAAAAGATA 


CCGACAACCA 


GATACCAACA 


2580 


TCAGACTACA 


GATCCACATC 


AAAAGGGCAG 


AAGAAAATCT 


CAAAGACAAC 


AACCACCAAC 


2640 


ACCGACACAA AGGGGCAAAC 


AGAAATACAG 


ACAGAATCAT 


CAGAAACACA ATCCTCATCA 


2700 


TGGAATCTCA 


TCATCGACAA 


CAACACCGAC 


CGGAACGAAC 


AGACAAGCAC 


AACTCCTCCA 


2760 


ACAACAACTT 


CCAGATCAAC 


TTATACAAAA 


GAATCGATCC 


GAACAAACTC 


TGAATCCAAA 


2820 


CCCAAGACAC 


AAAAGACAAA 


TGGAAAGGAA 


A6GAAGGATA 


CAGAAGAGAG 


CAATCGATTT 


2880 


ACAGAGAGGG 


CAATTACTCT 


ATTGCAGAAT 


CTTGGTGTAA 


TTCAATCCAC 


ATCAAAACTA 


2940 


GATTTATATC 


AAGACAAACG 


AGTTGTATGT 


GTAGCAAATG 


TACTAAACAA 


TGTAGATACT 


3000 


GCATCAT^GA 


TAGATTTCCT 


GGCAGGATTA 


GTCATAGGGG 


TTTCAATGGA 


CAACGACrACA 


3060 


AAATTAACAC 


AGATACAATU^ 


TGAAATGCTA AACCTCAAAG 


CAGATCTAAA 


GAAAATGGAC 


3120 


GTJVTCACATA GAAGATTGAT AGAAAATCAA AGAGAACAAC 


TGTCATTGAT 


CACGTCACTA 


3180 


ATTTCAAATC 


TCAAAATTAT 


GACTGAGAGA 


6GAG6AAAGA 


AAGACCAAAA 


TGAATCCAAT 


3240 


GAGAGAGTAT 


CCATGATCAA 


AACAAAATT6 


AAAGAAGAAA 


AGATCAAGAA 


GACCAGGTTT 


3300 


GACCCACTTA TGGAGGCACA AGGCATTGAC 


AA6AATATAC 


CCGATCTATA 


TCGACATGCA 


3360 
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GGAGATACAC 


TAGAGAACGA 


TGTACAAGTT 


AAATCAGAGA 


TATTAAGTTC 


ATACAATGAG 


3420 


TCAAATGCAA 


CAAGACTAAT 


ACCCAAAAAA 


GTGAGCAGTA 


CAATGAGATC 


ACTAGTTGCA 


3480 


GTCATCAACA 


ACAGCAATCT 


CTCACAAAGC 


ACAA7UVCAAT 


CATACATAAA 


CGAACTCAAA 


3540 


C6TTGCAAAA 


ATGATGAAGA 


AGTATCTGAA 


TTAATGGACA 


TGTTCAAT6A 


AGATGTOUVC 


3600 


AATTGCCAAT 


GATCCAACAA AGAAACGACA 


CCGAACAAAC 


AGACAAGA7VA 


CAACAGTAGA 


3660 


TCAAAACCTG 


TCAACACACA CAAAATCAAG 


CAGAATGAAA 


CAACAGATAT 


CAATCAATAT 


3720 


ACAAATAAGA 


AAAACTTAGG 


ATTATUVGAAT 


AAATTAATCC 


TTGTCCAAAA 


TGAGTATAAC 


3780 


TAACTCTGCA 


ATATACACAT 


TCCCAGAATC 


ATCATTCTCT 


GAAAATGGTC 


ATATAGAACC 


3840 


ATTACCACTC 


AAAGTCAATG 


AACAGAGGAA 


AGCAGTACCC 


CACATTAGAG 


TTGCCAAGAT 


3900 


CGGAAATCCA 


CCAAAACACG 


GATCCCGGTA 


TTTAGATGTC 


TTCTTACTCG 


GCTTCTTCGA 


3960 


GATGGAACGA ATCAAAGACA AATACGGGAG 


TGTGAATGAT 


CTCGACAGTG 


ACCCGAGTTA 


4020 


CAAAGTTTGT 


GGCTCTGGAT 


CATTACCAAT 


CGGATTGGCT 


AAGTACACTG 


GGAATGACCA 


4080 


GGAATTGTTA 


CT^GCCGCAA 


CCAAACTGGA 


TATAGAAGTG 


AGAAGAACAG 


TCAAAGCGAA 


4140 


AGAGATGGTT 


GTTTACACGG 


TACAAAATAT 


AAAACCAGAA 


CTGTACCCAT 


6GTCCAATAG 


4200 


ACTAAGAAAA 


GGAATGCTGT 


TCGATGCCAA 


CAAAGTTGCT 


CTTGCTCCTC 


AATGTCTTCC 


4260 


ACTAGATAGG 


AGCATAAAAT 


TTAGAGTAAT 


CTTCGTGAAT 


TGTACGGCAA 


TTGGATCAAT 


4320 


AACCTTGTTC 


AAAATTCCTA 


AGTCAATGGC 


ATCACTATCT 


CTACCCAACA 


CAATATCAAT 


4380 


CAATCTGCAG 


GTACACATAA AAACAGGGGT 


TCAGACTGAT 


TCTAAAGGGA 


TAGTTCAAAT 


4440 


TTTGGATGAG 


AAAGGCGAAA AATCACTGAA 


TTTCATGGTC 


CATCTCGGAT 


TGATCAAAAG 


4500 


AAAAGTAGGC 


AGAATGTACT 


CTGTTGAATA 


CTGTAAACAG 


AAAATCGAGA 


AAATGAGATT 


4560 


GATATTTTCT 


TTAGGACTAG 


TTGGAGGAAT 


CAGTCTTCAT 


GTCAATGCAA 


CTGGGTCCAT 


4620 


ATCAAAAACA 


CTAGCAAGTC 


AGCTGGTATT 


CAAAAGAGAG 


ATTTGTTATC 


CTTTAATGGA 


4680 


TCTAAATCCG 


CATCTCAATC 


TAGTTATCTG 


GGCTTCATCA 


GTAGAGATTA 


CAAGAGTGGA 


4740 


TGCAATTTTC 


CAACCTTCTT 


TACCTGGCGA 


GTTCAGATAC 


TATCCTAATA 


TTATTGCAAA 


4800 


AGGAGTTGGG 


AAAATCAAAC 


AATGGAACTA 


GTAATCTCTA 


TTTTAGTCCG 


GACGTATCTA 


4860 


TTAAGCCGAA 


GCAAATAAA6 


GATAATCAAA 


AACTTAGGAC 


AAAAGAGGTC 


AATACCAACA 


4920 


ACTATTAGCA 


GTCACACTCG 


CAAGAATAAG 


A6AGAAGGGA 


CCAAAAAAGT 


CAAATAGGAG 


4980 


AAATCAAAAC 


AAAAGGTACA 


GAACACCAGA 


ACAACAAAAT 


CAAAACATCC 


AACTCACTCA 


5040 


AAACAAAAAT 


TCCAAAAGAG 


ACCGGCAACA 


CAACAAGCAC 


TGAACACAAT GCCAACTTCA 


5100 


ATACTGCTAA 


TTATTACAAC 


CATGATCATG 


GCATCTTTCT 


GCCAAATAGA 


TATCACAAAA 


5160 


CTACAGCACG 


TAGGTGTATT 


GGTCAACAGT 


CCCAAAGGGA 


TGAAGATATC 


ACAAAACTTT 


5220 


GAAACAAGAT 


ATCTAATTTT 


GAGCCTCATA 


CCAAAAATAG 


AAGACTCTAA 


CTCTTGTGGT 


5280 


GACCAACAGA 


TCAAGCAATA 


CAAGAAGTTA 


TTGGATA6AC 


TGATCATCCC 


TTTATATGAT 


5340 


GGATTAAGAT 


TACAGAAAGA 


TGTGATAGTA 


ACCAATCAAG 


AATCCAATGA 


AAACACTGAT 


5400 


CCCAGAACAA 


AACGATTCTT 


TGGAGGGGTA ATTGGAACCA TTGCTCTGGG AGTAGCAACC 


5460 
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TCAGCACAAA 


TTACAGCGGC 


AGTTGCTCTG 


GTTGAAGCCA AGCAGGCAAG 


ATCAGACATC 


5520 


6AAAAACTCA 


AAGAA6CAAT 


TAGGGACACA 


AACAAAGCAG 


TGCAGTCAGT 


TCAGAGCTCC 


5580 


ATAGGAAATT 


TAATAGTAGC 


AATTAAATCA 


GTCCA6GATT 


ATGTTAACAA 


AGAAATCGTG 


5640 


CCATCGATTG 


CGAGGCTAGG 


TTGTGAAGCA 


GCAGGACTTC 


AATTA6GAAT 


TGCATTAACA 


5700 


CAGCATTACT 


CAGAATTAAC 


AAACATATTT 


GGTGATAACA 


TAGGATCGTT 


ACAAGAAAAA 


5760 


6GAATAAAAT 


TACAAGGTAT 


AGCATCATTA 


TACCGCACAA 


ATATCACAGA 


AATATTCACA 


5820 


ACATCAACAG 


TTGATAAATA 


TGATATCTAT 


GATCTGTTAT 


TTACAGAATC 


AATAAAGGTG 


5880 


AGAGTTATAG 


ATGTTGACTT 


GAATGATTAC 


TCAATCACCC 


TCCAAGTCAG 


ACTCCCTTTA 


5940 


TTAACTAGGC 


TGCTGAACAC 


TCAGATCTAG 


AAAGTAGATT 


CCATATCATA 


TAACATCCAA 


6000 


AACAGAGAAT 


GGTATATCCC 


TCTTCCCAGC 


CATATCATGA 


CGAAAGGGGC 


ATTTCTAGGT 


6060 


GGAGCAGACG 


TCAAAGAATG 


TATAGAAGCA 


TTCAGCAGCT 


ATATATGCCC 


TTCTGATCCA 


6120 


GGATTTGTAT 


TAAACCATGA 


AATAGAGAGC 


TGCTTATCAG 


GAAACATATC 


CCAATGTCCA 


6160 


AGAACAACGG 


TCACATCAGA 


CATTGTTCCA AGATATGCAT 


TTGTCAATGG 


AGGAGTGGTT 


6240 


GCAAACTGTA 


TAACAACCAC 


CTGTACATGC 


AACGGAATTG 


GTAATAGAAT 


CAATCAACCA 


6300 


CCTGATCAAG 


6AGTAAAAAT 


TATAACACAT 


AAAGAATGTA 


GTACAATAGG 


TATCAACGGA 


6360 


ATGCTGTTCA 


ATACAAATAA 


AGAAGGAACT 


CTTGCATTCT 


ATACACCAAA 


TGATATAACA 


6420 


CTAAACAATT 


CTGTTGCACT 


TGATCCAATT 


GACATATCAA 


TCGAGCTCAA 


CAAGGCCAAA 


6480 


TCAGATCTAG 


AAGAATCAAA 


AGAATGGATA AGAAGGTCAA 


ATCAAAAACT 


AGATTCTATT 


6540 


GGAAATTGGC 


ATC7VATCTAG 


CACTACAATC 


ATAATTATTT 


TGATAATGAT 


CATTATATTG 


6600 


TTTATAATTA 


ATATAACGAT 


AATTACAATT 


GCAATTAAGT 


ATTACAGAAT 


TCAAAAGAGA 


6660 


AATCGAGTGG 


ATCAAAATGA 


CAAGCCATAT 


GTACTAACAA 


ACAAATAACA 


TATCTACAGA 


6720 


TCATTAGATA 


TTAAAATTAT 


AAAAAACTTA 


GGAGTAAAGT 


TACGCAATCC 


AACTCTACTC 


6780 


ATATAATTGA 


GGAAGGACCC 


AATAGACAAA 


TCCAAATTCG 


AGATGGAATA 


CTGGAAGCAT 


6840 


ACCAATCACG 


GAAAGGATGC 


TGGTAATGAG 


CTGGAGACGT 


CTATGGCTAC 


TCATGGCAAC 


6900 


AAGCTCACTA ATAA6ATAAT 


ATACATATTA 


TGGACAATAA 


TCCTGGTGTT 


ATTATCAATA 


6960 


GTCTTCATCA 


TAGTGCTAAT 


TAATTCCATC 


AT^GTGAAA 


AGGCCCACGA ATCATTGCTG 


7020 


CAAGACATAA 


ATAATGAGTT 


TATGGAAATT 


ACAGAAAAGA 


TCCAAATGGC 


ATCGGATAAT 


7080 


ACCAATQATC 


TAATACAGTC 


AGGAGTGAAT 


ACAAGGCTTC 


TTACAATTCA 


GAGTCATGTC 


7140 


CAGAATTACA 


TACCAATATC 


ATTGACACAA 


CA6ATGTCA6 


ATCTTAGGAA 


ATTCATTAGT 


7200 


GAAATTACAA 


TTAGAAATGA 


TAATCAAGAA 


GTGCTGCCAC 


TU^GAATAAC 


ACATGATGTA 


7260 


GGTATAAAAC 


CTTTAAATCC 


AGATGATTTT 


TGGAGATGCA 


CGTCTGGTCT 


TCCATCTTTA 


7320 


ATGAATUICTC 


CAAAAATAAG 


GTTAATGCCA 


GGGCCGGGAT 


TATTAGCTAT 


GCCAACGACT 


7380 


GTTGATGGCT 


GTGTTAGAAC 


TCCGTCTTTA 


GTTATAAATG 


ATCTGATTTA 


TGCTTATACC 


7440 


TCAAATCTAA TTACTCGAGG 


TTGTCAGGAT 


ATAGGAAAAT 


CATATCAAGT 


CTTACAGATA 


7500 


GGGATAATAA 


CTGTAAACTC 


AGACTTGGTA 


CCTGACTTAA ATCCTAGGAT 


CTCTCATACC 


7560 



wo 98/53078 



PCT/US98/10551 



144 



TTTAACATAA ATGACAATAG 


GAAGTCATGT 


TCTCTAGCAC 


TCCTAAATAT 


AGATGTATAT 


7620 


CAACTGTGTT 


CAACTCCCAA 


AGTTGATGAA 


AGATCAGATT 


ATGCATCATC 


AGGCATAGAA 


7680 


GATATTGTAC 


TTGATATTGT 


CAATTATGAT 


GGTTCAATCT 


CAACAACAAG 


ATTTAAGAAT 


7740 


AATAACATAA GCTTTGATCA ACCATATGCT 


GCACTATACC 


CATCTGTTGG 


ACCAGGGATA 


7800 


TACTACAAAG 


GCAAAATAAT 


ATTTCTCGGG 


TATGGAGGTC 


TTGTU^CATCC 


AATAAATGAG 


7860 


AATGTAATCT 


GCAACACAAC 


TGGGTGCCCC 


GGGAAAACAC 


AGAGAGACTG 


TAATCAAGCA 


7920 


TCTCATAGTA 


CTTGGTTTTC 


AGATAGGAGG 


ATGGTCAACT 


CCATCATTGT 


TGTTGACAAA 


7980 


GGCTTAAACT 


CAATTCCAAA 


ATTGAAAGTA 


TGGACGATAT 


CTATGCGACA 


AAATTACTGG 


8040 


GGGTCAGAAG 


GAAGGTTACT 


TCTACTAGGT 


AACAAGATCT 


ATATATATAC 


AAGATCTACA 


8100 


A6TTGGCATA 


GCAAGTTACA 


ATTAGGAATA 


ATTGATATTA 


CTGATTACAG 


TGATATAAGG 


8160 


ATAAAATGGA 


CATGGCATAA 


TGTGCTATCA AGACCAGGAA 


ACAATGAATG 


TCCATGGGGA 


8220 


CATTCATGTC 


CAGATGGATG 


TATAACAGGA 


GTATATACTG 


ATGCATATCC 


ACTCAATCCC 


8280 


ACAGGGAGCA 


TTGTGTCATC 


TGTCATATTA GACTCACAAA 


AATCGAGAGT 


GAACCCAGTC 


8340 


ATAACTTACT 


CAACAGCAAC 


CGAAAGAGTA 


T^CGAGCTGG 


CCATCCTATU^ 


CAGAACACTC 


8400 


TCAGCTGGAT ATACAACAAC 


AAGCTGCATT ACACACTATA ACAAAGGATA 


TTGTTTTCAT 


8460 


ATAGTAGAAA 


TAAATCATAA 


AAGCTTAAAC 


ACATTTCAAC 


CCATGTTGTT 


CAAAACAGAG 


8520 


ATTCCAAAAA 


GCTGCAGTTA 


ATCATAATTA 


ACCATAATAT 


GCATCAATCT 


ATCTATAATA 


8580 


CAAGTATATG 


ATAAGTAATC 


AGCAATCAGA 


CAATAGACAA 


AAGGGAAATA 


TAAAAAACTT 


8640 


AGGAGCAAAG 


CGTGCTCGGG 


AAATGGACAC 


TGAATCTAAC 


AATGGCACT6 


TATCTGACAT 


8700 


ACTCTATCCT 


GAGTGTCACC 


TTAACTCTCC 


TATCGTTAAA 


GGTAAAATAG 


CACAATTACA 


8760 


CACTATTATG 


AGTCTTACCTC 


AGCCTTATGA 


TATGGATGAC 


GACTCAATAC 


TAGTTATCAC 


8820 


TAGACAGAAA ATAAAACTTA ATAAATTGGA 


TAAAAGACAA 


CGATCTATTA 


GAAGATTAAA 


8880 


ATTAATATTA 


ACTGAAAAAG 


TGAATGACTT 


AGGAAAATAC 


ACATTTATCA 


GATATCCAGA 


8940 


AATGTCAAAA 


GA7UVTGTTCA 


AATTATATAT 


ACCTGGTATT 


AACAGTAAAG 


TGACTGAATT 


9000 


ATTACTTAAA GCAGATAGAA 


CATATAGTCA 


AATGACTGAT 


GGATTAAGAG 


ATCTATGGAT 


9060 


TAATGTGCTA 


TCAAAATTAG 


CCTCAAAAAA 


TGATGGAAGC 


AATTATGATC 


TTAATGAAGA 


9120 


AATTAATAAT 


ATATCGAAAG 


TTCACACAAC 


CTATAAATCA 


GATAAATGGT 


ATAATCCATT 


9180 


CAAAACATG6 


TTTACTATCA 


AGTATGATAT 


GAGAAGATTA 


CAAAAAGCTC 


GAAATGAGAT 


9240 


CACTTTTAAT 


GTTGGGAAGG 


ATTATAACTT 


GTTAGAAGAC 


CAGAAGAATT 


TCTTATTGAT 


9300 


ACATCCAGAA 


TTGGTTTTGA 


TATTAGATAA 


ACAAAACTAT 


AATGGTTATC 


TAATTACTCC 


9360 


TGAATTAGTA 


TTGATGTATT 


GTGACGTAGT 


CGAAGGCCGA 


TGGAATATAA 


GTGCATGTGC 


9420 


TAAGTTAGAT 


CCAAAATTAC 


AATCTATGTA 


TCAGAAAGGT 


AATAACCTGT 


GGGAAGTGAT 


9480 


AGATAAATTG 


TTTCCAATTA 


TGGGAGAAAA 


GACATTTGAT 


GTGATATCGT 


TATTAG7ACC 


9540 


ACTTGCATTA 


TCCTTAATTC 


AAACTCATGA 


TCCTGTTAAA 


CAACTAAGAG 


GAGcrrrrrr 


9600 


AAATCATGTG 


TTATCCGAGA TGGAATTAAT 


ATTTGAATCT 


AGAGAATCGA 


TTAAGGAATT 


9660 



wo 98/53078 



PCTAJS98/10551 



145 



TCTGAGTGTA GATTACATTG 


ATAAAATTTT 


AGATATATTT 


AATAAGTCTA 


CAATAGATGA 


9720 


AATAGCAGA6 


ATTTTCTCTT 


TTTTTAGT^C 


ATTTGGGCAT 


CCTCCATTAG 


AAGCTAGTAT 


9780 


TGCAGCAGAA 


AAGGTTAGAA 


AATATATGTA 


TATTGGAAAA 


CAATTAAAAT 


TTGACACTAT 


9840 


TAATAAATGT 


CATGCTATCT 


TCTGTACAAT 


AATAATTAAC 


GGATATAGAG 


AGAGGCATGG 


9900 


TGGACAGTGG 


CCTCCTGTGA 


CATTACCTGA TCATGCACAC 


GAATTCATCA 


TAAATGCTTA 


9960 


CGGTTCAAAC 


TCTGCGATAT 


CATATGAAAA 


TGCTGTTGAT 


TATTACCAGA 


GCTTTATAGG 


10020 


AATAAAATTC 


AATAAATTCA 


TAGAGCCTCA 


GTTAGAT6AG 


GATTTGACAA 


TTTATATGAA 


10080 


AGATAAAGCA 


TTATCTCCAA 


AAAAATCAAA 


TTGGGACACA 


6TTTATCCTG 


CATCTAATTT 


10140 


ACTGTACCGT 


ACTAACGCAT 


CCAACGAATC 


ACGAAGATTA 


GTTGAAGTAT 


TTATAGCAGA 


10200 


TAGTAAATTT 


GATCCTCATC 


AGATATTGGA 


TTATGTAGAA 


TCTGGGGACT 


GGTTAGATGA 


10260 


TCCAGAATTT AATATTTCTT ATAGTCTTAA AGAAAAAGAG 


ATCAAACAGG 


AAGGTAGACT 


10320 


CTTTGCAAAA ATGACATACA AAATGAGAGC 


TACACAAGTT 


TTATCAGAGA 


CCCTACTTGC 


10380 


AAATAACATA 


GGAAAATTCT 


TTCAAGAAAA 


TGGGATGGTG 


AAGGGAGAGA 


TTGAATTACT 


10440 


TAAGAGATTA 


ACAACCATAT 


CAATATCAGG 


AGTTCCACGG 


TATAATGAAG 


TGTACAATAA 


10500 


TTCTAAAAGC 


CATACAGATG 


ACCTTAAAAC 


CTACAATAAA 


ATAAGTAATC 


TTAATTTGTC 


10560 


TTCTAATCAG 


AAATCAAAGA 


AATTTGAATT 


CAAGTCAACG 


GATATCTACA 


ATGATGGATA 


10620 


CGAGACTGTG 


AGCTGTTTCC 


TAACAACAGA 


TCTCAAAAAA 


TACTGTCTTA 


ATTGGAGATA 


10680 


TGAATCAACA 


GCTCTATTTG 


GAGAAACTTG 


CAACCAAATA 


TTTGGATTAA 


ATAAATTGTT 


10740 


TAATTGGTTA 


CACCCTCGTC 


TT6AAGGAAG 


TACAATCTAT 


GTAGGTGATC 


CTTACTGTCC 


10800 


TCCATCAGAT 


AAAGAACATA 


TATCATTAGA 


GGATCACCCT 


GATTCTGGTT 


TTTACGTTCA 


10860 


TAACCCAAGA 


GGGGGTATAG 


AAGGATTTTG 


TCAATUUITTA 


TGGACACTCA 


TATCTATAAG 


10920 


TGCAATACAT 


CTAGCAGCTG 


TTAGAATAGG 


CGTGAGGGTG 


ACTGCAATGG 


TTCAAGGAGA 


10980 


CAATCAAGCT 


ATAGCTGTAA 


CCACAAGAGT 


ACCCAAC/UVT 


TATGACTACA 


GAGTTAAGAA 


11040 


GGAGATAGTT 


TATAAAGATG 


TAGTGAGATT 


TTTTGATTCA 


TTAAGAGAAG 


TGATGGATGA 


11100 


TCTAGGTCAT 


GAACTTAAAT 


TAAATGAAAC 


GATTATAAGT 


AGCAAGATGT 


TCATATATAG 


11160 


CAAAAGAATC 


TATTATGATG 


GGAGAATTCT 


TCCTCAAGCT 


CTAAAAGCAT 


TATCTAGATG 


11220 


TGTCTTCTGG 


TCAGAGACAG 


TAATAGACGA AACAAGATCA 


GCATCTTCAA 


ATTTGGCAAC 


11280 


ATCATTT6CA 


AAAGCAATTG 


AGAATGGTTA 


TTCACCTGTT 


CTAGGATATG 


CATGCTCAAT 


11340 


TTTTAAGAAT 


ATTCAACAAC 


TATATATTGC 


CCTTGGGATG AATATCAATC CAACTATAAC 


11400 


ACAGAATATC 


AGAGATCAGT 


ATTTTAGGAA 


TCCAAATTGG 


ATGCAATATG 


CCTCTTTAAT 


11460 


ACCTGCTAGT 


GTTGGGGGAT 


TCAATTACAT 


GGCCATGTCA 


AGATGTTTTG 


TAAGGAATAT 


11520 


TGGTGATCCA 


TCAGTTGCCG 


CATTGGCTGA 


TATTAAAAGA 


TTTATTAAGG 


CGAATCTATT 


11580 


AGACCGAAGT 


GTTCTTTATA 


GGATTATGAA 


TCAAGAACCA 


GGTGAGTCAT 


CTTTTTTGGA 


11640 


CTGGGCTTCA 


GATCCATATT 


CATGCAATTT 


ACCACAATCT 


CAAAATATAA 


CCACCy^TGAT 


11700 


AAAAAATATA ACAGCAAGGA 


ATGTATTACA AGATTCACCA AATCCATTAT 


TATCTGGATT 


11760 
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ATTCACAAAT ACAATGATAG AAGAAGATGA AGAATTAGCT GAGTTCCTGA TGGACAGGAA 
GGTAATTCTC CCTAGAGTTG CACATGATAT TCTAGATAAT TCTCTCACAG GAATTAGAAA 
TGCCATAGCT GGAATGTTAG ATACGACAAA ATCACTAATT CGGGTTGGCA TAAATAGAGG 
AGGACTGACA TATAGTTTGT TGAGGAAAAT CAGTAATTAC GATCTAGTAC AATATGAAAC 
ACTAAGTAGG ACTTTGCGAC TAATTGTAAG TGATAAAATC AAGTATGAAG ATATGTGTTC 
GGTAGACCTT GCCATAGCAT TGCGACAAAA GATGTGGATT CATTTATCAG GAGGAAGGAT 
GATAAGTGGA CTTGAAACGC CTGACCCATT AGAATTACTA TCTGGGGTAG TAATAACAGG 
ATCAGAACAT TGTATUVATAT GTTATTCTTC AGATGGCACA AACCCATATA CTTGGATGTA 
TTTACCCGGT AATATCAAAA TAGGATCAGC AGAAACAGGT ATATCGTCAT TAAGAGTTCC 
TTATTTTGGA TCAGTCACTG ATGAAAGATC TGAAGCACAA TTAGGATATA TCAAGAATCT 
TAGTAAACCT GCAAAAGCCG CAATAAGAAT AGCAATGATA TATACATGGG CATTTGGTAA 
TGATGAGATA TCTTG6ATGG AAGCCTCACA GATAGCACAA ACACGTGCAA ATTTTACACT 
AGATAGTCTC AAAATTTTAA CACCGGTAGC TACATCAACA AATTTATCAC ACAGATTAAA 
GGATACTGCA ACTCAGATGA T^TTCTCCAG TACATCATTG ATCAGAGTCA GCAGATTCAT 
AACAATGTCC AATGATAACA TGTCTATCAA AGAAGCTAAT GAAACCAAAG ATACTAATCT 
TATTTATCAA CAAATAATGT TAACAGGATT AAGTGTTTTC GAATATTTAT TTAGATTAAA 
AGAAACCACA GGACACAACC CTATAGTTAT GCATCTGCAC ATAGAAGATG AGTGTTGTAT 
TAAAGAAAGT TTTAATGATG AACATATTAA TCCAGAGTCT ACATTAGAAT TAATTCGATA 
TCCTGAAAGT AATGAATTTA TTTAT6ATAA AGACCCACTC AAAGATGTGG ACTTATCAAA 
ACTTATGGTT ATTAAAGACC ATTCTTACAC AATTGATATG AATTATTGGG ATGATACTGA 
CATCATACAT GCAATTTCAA TATGTACTGC AATTACAATA GCAGATACTA TGTCACAATT 
AGATCGAGAT T^TTTAAAAG AGATAATAGT TATTGCAAAT GATGATGATA TTAATAGCTT 
AATCACTGAA TTTTTGACTC TTGACATACT TGTATTTCTC AAGACATTTG GTGGATTATT 
AGTAAATCAA TTTGCATACA CTCTTTATAG TCTAAAAATA 6AAGGTAGGG ATCTCATTTG 
GGATTATATA ATGAGAACAC TGAGAGATAC TTCCCATTCA ATATTAAAAG TATTATCTAA 
TGCATTATCT CATCCTAAAG TATTCAAGAG GTTCTGGGAT TGTGGAGTTT TAflACCCTAT 
TTATGGTCCT AATACTGCTA GTCAAGACCA GATAAAACTT GCCCTATCTA TATGTGAATA 
TTCACTAGAT CTATTTATGA GAGAATGGTT GAATGGTGTA TCACTTGAAA TATACATTTG 
TGACAGCGAT ATGGAAGTTG CAAATGATAG GAAACAAGCC TTTATTTCTA GACACCTTTC 
ATTTGTTTGT TGTTTAGCAG AAATTGCATC TTTCGGACCT AACCTGTTAA ACTTAACATA 
CTTGGAGAGA CTTGATCTAT TGAAACAATA TCTTGAATTA AATATTAAAG AAGACCCTAC 
TCTTAAATAT GTACAAATAT CTGGATTATT AATTAAATCG TTCCCATCAA CTGTAACATA 
CGTAAGAAAG ACTGCAATCA AATATCTAAG GATTCGCGGT ATTAGTCCAC CT6AGGTAAT 
TGATGATTGG GATCCGGTAG 7VAGATGAAAA TATGCTGGAT AACATTGTCA AAACTATAAA 
TGATAACTGT AATAAAGATA ATAAAGGGAA TAAAATTAAC AATTTCTGGG GACTAGCACT 
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TAAGAACTAT 


CAAGTCCTTA AAATCAGATC 


TATAACAAGT 


GATTCTGATG 


ATAATGATAG 


13920 


ACTAGATGCT 


AATACAAGTG 


GTTTGACACT 


TCCTC7UVGGA 


GGGAATTATC 


TATCGCATCA 


13980 


ATTGAGATTA 


TTCGGAATCA 


ACAGCACTAG 


TTGTCTGAAA 


GCTCTTGAGT 


TATCACAAAT 


14040 


TTTAATGAAG 


GAAGTCAATA 


AAGACAAGGA 


CAGGCTCTTC 


CTGG6AGAAG 


GAGCAGGAGC 


14100 


TATGCTAGCA 


TGTTATGATG 


CCACATTAGG 


ACCTGCAGTT 


AATTATTATA 


ATTCAGGTTT 


14160 


GAATATAACA 


GATGTAATTG 


GTCAACGAGA 


ATTGAAAATA 


TTTCCTTCAG 


AGGTATCATT 


14220 


AGTAGGTAAA 


AAATTAGGAA 


ATGTGACACA 


GATTCTTAAC 


AGGGTAAAAG 


TACTGTTCAA 


14280 


TGG6AATCCT 


AATTCAACAT 


GGATAGGAAA 


TATGGAATGT 


GAGAGCTTAA 


TATGGAGTGA 


14340 


ATTAAATGAT 


AAGTCCATTG 


GATTAGTACA 


TTGTGATATG 


GAAGGAGCTA 


TCGGTAAATC 


14400 


AGAAGAAACT 


GTTCTACATG 


AACATTATAG 


TGTTATAAGA 


ATTACATACT 


TGATTGGGGA 


14460 


TGATGATGTT 


GTTTTAGTTT 


CCAAAATTAT 


ACCTACAATC 


ACTCCGAATT 


GGTCTAGAAT 


14520 


ACTTTATCTA 


TATAAATTAT 


ATTGGAAAGA 


TGTAAGTATA 


ATATCACTCA AAACTTCTAA 


14580 


TCCTGCATCA 


ACAGAATTAT 


ATCTAATTTC 


GAAAGATGCA 


TATTGTACTA 


TAATGGAACC 


14640 


TAGTGAAATT 


GTTTTATCAA 


AACTTAAAAG 


ATTGTCACTC 


TTGGAAGAAA 


ATAATCTATT 


14700 


AAAATGGATC 


ATTTTATCAA 


AGAAGAGGAA 


TAATGAATGG 


TTACATCATG 


AAATCAAAGA 


14760 


AGGAGAAAGA 


GATTATGGAA 


TCATGAGACC 


ATATCATATG 


GCACTACAAA 


TCTTTGGATT 


14820 


TCAAATCAAT 


TTAAATCATC 


TGGCGAAAGA 


ATTTTTATCA ACCCCAGATC 


TGACTAATAT 


14880 


CAACAATATA 


ATCCAAAGTT 


TTCAGCGAAC 


AATAAAGGAT 


GTTTTATTTG 


AATGGATTAA 


14940 


TATAACTCAT 


GATGATAAGA 


GACATAAATT 


AGGCGGAAGA 


TATAACATAT 


TCCCACTGAA 


15000 


AAATAAGG6A 


AAGTTAAGAC 


TGCTATCGAG 


AAGACTAGTA 


TTAAGTTGGA 


TTTCATTATC 


15060 


ATTATCGACT 


CGATTACTTA 


CAGGTCGCTT 


TCCTGATGAA 


AAATTTGAAC 


ATAGAGCACA 


15120 


GACTGGATAT 


GTATCATTAG 


CTGATACTGA 


TTTAGAATCA 


TTAAAGTTAT 


TGTCGAAAAA 


15180 


CATCATTAAG 


AATTACAGAG 


AGTGTATAGG 


ATCAATATCA 


TATTGGTTTC 


TAACCAAAGA 


15240 


AGTTAAAATA 


CTTATGAAAT 


TGATCGGTGG 


TGCTAAATTA 


TTAGGAATTC 


CCAGACAATA 


15300 


TAAAGAACCC 


6AAGACCAGT 


TATTAGAAAA 


CTACAATCAA 


CATGATGAAT 


TTGATATCGA 


15360 


TTAAAACATA 


AATACAATGA 


AGATATATCC 


TAACCTTTAT 


CTTTAAGCCT 


AGGAATAGAC 


15420 


AAAAAGTAAG 


AAAAACATGT 


AATATATATA 


TACCAAACAG 


AGTTCTTCTC 


TTGTTTGGTG 


15480 


GGTCGGCATG 


GCATCTCCAC 


CTCCTCGCGG 


TCCGGACCTG 


GGCATCCGAA 


GGAGGACGCA 


15540 


CGTCCACTCG 


GATGGCTAAG 


GGAGAGCCTG 


CAGTAGCATA ACCCCTTGGG 


GCCTCTAAAC 


15600 


GGGTCTTGAG 


GGGTTTTTTG 


CTGAAAGGAG 


GAACTATATA 


CGCGTCGACG 


GGCCCCGCGC 


15660 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15666 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



TAATACGACT 


CACTATAGGA 


CCAAACAAGA 


GAA6AAACTT 


GTCTGGGAAT 


ATAAATTTAA 


60 


CTTTAAATTA 


ACTTAGGATT 


AAAGACATTG 


ACTAGAAGGT 


CAAGAAAAGG 


GAACTCTATA 


120 


ATTTCAAAAA 


TGTTGAGCCT 


ATTTGATACA 


TTTAATGCAC 


GTAGGCAAGA 


AAACATAACA 


180 


AAATCAGCCG 


GTG6AGCTAT 


CATTCCTGGA 


CAGAAAAATA 


CTGTCTCTAT 


ATTCGCCCTT 


240 


6GACCGACAA 


TAACTGATGA 


TAATGAGAAA 


ATGACATTAG 


CTCTTCTATT 


TCTATCTCAT 


300 


TCACTAGATA ATGAGAAACA ACATGCACAA AGGGCA6G6T 


TCTTGGTGTC 


TTTATTGTCA 


360 


ATGGCTTATG 


CCAATCCAGA 


GCTCTACCTA ACAACAAATG 


GAAGTAATGC 


AGATGTCAAG 


420 


TATGTCATAT 


ACATGATTGA 


GAAAGATCTA 


AAACGGCAAA 


AGTATGGAGG 


ATTTGTGGTT 


480 


AAGACGAGAG 


AGATGATATA 


TGAT^GACA 


ACTGATTGGA 


TATTTGGAAG 


TGACCTGGAT 


540 


TATGATCAGG 


AAACTATGTT 


GCAGAACGGC 


AGGAACAATT 


CAACAATTGA 


AGACCTTGTC 


600 


CACACATTTG 


GGTATCCATC 


ATGTTTAGGA 


6CTCTTATAA 


TACAGATCTG 


GATAGTTCTG 


660 


GTCAAAGCTA 


TCACTAGTAT 


CTCAGGGTTA 


AGAAAAGGCT 


TTTTCACCCG 


ATTGGAAGCT 


720 


TTCAGACAAG 


ATGGAACAGT 


GCAGGCA6GG 


CTGGTATTGA 


GCGGTGACAC 


AGTGGATCAG 


780 


ATTGGGTCAA 


TCATGCGGTC 


TCAACAGAGC 


TTGGTAACTC 


TTATGGTTGA 


AACATTAATA 


840 


ACAATGAATA 


CCAGCAGAAA 


TGACCTCACA ACCATAGAAA 


AGAATATACA 


AATTGTTGGC 


900 


AACTACATAA 


GAGATGCAGG 


TCTCGCTTCA 


TTCTTCAATA 


CAATCAGATA 


TGGAATTGAG 


960 


ACCAGAATGG 


CAGCTTTGAC 


TCTATCCACT 


CTCAGACCAG 


ATATCAATAG 


ATTAAAAGCT 


1020 


TTGATGGAAC 


TGTATTTATC 


AAAGGGACCA 


CGCGCTCCTT 


TCATCTGTAT 


CCTCAGAGAT 


1080 


CCTATACATG 


GTGAGTTCGC 


ACCAGGCAAC 


TATCCTGCCA 


TATGGAGCTA 


TGCAATGGGG 


1140 


GTGGCAGTTG 


TACAAAATAG 


AGCCATGCAA 


CAGTATGTGA 


CGGGAAGATC 


ATATCTAGAC 


1200 


ATTGATATGT 


TCCAGCTAGG 


ACAAGCAGTA 


GCACGTGATG 


CCGAAGCTCA 


AATGAGCTCA 


1260 


ACACTGGAAG 


ATGAACTTGG 


AGTGACACAC 


GAATCTAAAG 


AAAGCTTGAA 


GAGACATATA 


1320 


AGGAACATAA 


ACAGTTCAGA 


GACATCTTTC 


CACAAACCGA 


CAGGTGGATC 


AGCCATAGAG 


1380 


ATGGCAATAG 


ATGAAGAGCC 


AGAACAATTC 


GAACATAGAG 


CAGATCAAGA 


ACAAAATGGA 


1440 


GAACCTCAAT 


CATCCATAAT 


TCAATATGCC 


TGGGCAGAAG 


GA/^TAGAAG 


CGATGATCAG 


1500 


ACTGAGCAAG 


CTACAGAATC 


TGACAATATC 


AAGACCGAAC 


AACAT^CAT 


CAGAGACAGA 


1560 


CTAAACAAGA 


GACTCAACGA 


CAAGAAGAAA 


CAAAGCAGTC 


AACCACCCAC 


TAATCCCACA 


1620 


AACAGAACAA 


ACCAGGACGA 


AATAGATGAT 


CTGTTTAACG 


CATTTGGAAG 


CAACTAATCG 


1680 


AATCAACATT 


TTAATCTAAA 


TCAATAATAA 


ATAAGAAAAA 


CTTAGGATTA 


AAGAATCCTA 


1740 


TCATACCGGA 


ATATAGGGTG 


GTAAATTTAG 


AGTCTGCTTG 


AAACTCAATC 


AATAGAGAGT 


1800 


TGATGGAAAG 


CGATGCTAAA AACTATCAAA TCAT6GATTC 


TTGG6AAGAG 


GAATCAAGAG 


1860 


ATAAATCAAC 


TAATATCTCC 


TCGGCCCTCA ACATGATTGA ATTCATACTC AGCACCGACC 


1920 
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CCCAAGAAGA 


CTTATCGGAA 


AACGACACAA 


TCAACACAAG AACCCAGCAA 


CTCAGTGCCA 


1980 


CCATCTGTCA 


ACCAGAAATC 


AAACCAACAG 


AAACAAGTGA GAAAGATAGT 


GGATCAACTG 


2040 


ACAAAAATAG 


ACAGTCTGGG 


TCATCACACG 


AATGTACAAC AGAAGCAAAA 


GATAGAAATA 


2100 


TTGATCAGGA 


AACTGTACAG 


AGAGGACCTG 


GGAQAAGAAG CAGCTCAGAT 


AGTAGAGCTG 


2160 


AGACTGTGGT CTCTGGAGGA ATCCCCAGAA GCATCACAGA TTCTAAAAAT GGAACCCAAA 


2220 


ACACGGAGGA 


TATTGATCTC 


AATGAAATTA 


GAAAGATGGA TAAGGACTCT 


ATTGAGGGGA 


2280 


AAATGCGACA 


ATCTGCAAAT 


GTTCCAAGCG 


AGATATCAGG AAGTGATGAC 


ATATTTACAA 


2340 


CAGAACAAAG 


TAGAAACA6T 


GATCATGGAA 


GAAGCCTGGA ATCTATCAGT 


ACACCTGATA 


2400 


CAAGATCAAT 


AAGTGTTGTT 


ACTGCTGCAA 


CACCAGATGA TGAAGAAGAA 


ATACTAATGA 


2460 


AAAATAGTAG 


GACAAAGAAA 


AGTTCTTCAA 


CACATCAAGA AGATGACAAA 


AGAATTAAAA 


2520 


AAGGGGGAAA AGGGAAAGAC 


TGGTTTAAGA 


AATCAAAAGA TACCGACT^C 


CAGATACCAA 


2580 


CATCAGACTA 


CAGATCCACA 


TCAAAAGGGC AGAAGAAAAT CTCAAAGACA ACAACCACCA 


2640 


ACACCGACAC 


AAAGGGGCAA 


ACAGAAATAC 


AGACAGAATC ATCAGAAACA 


CAATCCTCAT 


2700 


CATGGAATCT CATCATCGAC AACAACACCG ACCGGAACGA ACAGACAAGC ACAACTCCTC 


2760 


CAACAACAAC 


TTCCAGATCA 


ACTTATACAA AAGAATCGAT CCGAACAAAC 


TCTGAATCCA 


2820 


AACCCAAGAC 


ACAAAAGACA 


AATGGAAAGG 


AAAGGAAGGA TACAGAAGAG 


AGCAATCGAT 


2880 


TTACAGAGAG 


GGCAATTACT 


CTATTGCAGA ATCTTGGTGT AATTCAATCC 


ACATCAAAAC 


2940 


TAGATTTATA 


TCAAGACAAA 


CGAGTTGTAT 


GTGTAGCAAA TGTACTAAAC 


AATGTAGATA 


3000 


CTGCATCAAA 


GATAGATTTC 


CTGGCAGGAT 


TAGTCATAGG GGTTTCAATG 


GACAACGACA 


3060 


CAAAATTAAC 


ACAGATACAA 


AATGAAATGC 


TAAACCTCAA AGCAGATCTA 


AAGAAAATGG 


3120 


ACGAATCACA 


TAGAAGATTG 


ATAGAAAATC 


AAAGAGAACA ACTGTCATTG 


ATCACGTCAC 


3180 


TAATTTCAAA 


TCTCAAAATT 


ATGACTGAGA 


GAGGAGGAAA GAAAGACCAA 


AATGAATCCA 


3240 


ATGAGAGAGT 


ATCCATGATC 


AAAACAAAAT 


TGAAAGAAGA AAAGATCAAG 


AAGACCAGGT 


3300 


TTGACCCACT 


TATGGAGGCA 


CAAGGCATTG 


ACAAGAATAT ACCCGATCTA 


TATCGACATG 


3360 


CAGGAGATAC 


ACTAGAGAAC 


GATGTACAAG 


TTAAATCAGA GATATTAA6T 


TCATACAATG 


3420 


AGTCAAATGC 


AACAAGACTA 


ATACCCAT^ 


T^GTGAGCAG TACAATGAGA 


TCACTAGTTG 


3480 


CAGTCATCAA 


CAACA6CAAT 


CTCTCACAAA 


GCACAAAACA ATCATACATA 


AACGAACTCA 


3540 


AACGTTGCAA 


AAATGATGAA 


GAAGTATCTG 


AATT7UVTGGA CATGTTC7UVT 


GAA6ATGTCA 


3600 


ACAATTGCCA 


ATGATCCAAC 


TU^GAAACGA 


CACCGAACAA ACAGACAAGA 


AACAACAGTA 


3660 


GATCAAAACC 


TGTCAACACA 


CAC7VAAATCA AGCAGAATGA AACAACAGAT 


ATCAATCAAT 


3720 


ATACTVAATAA 


GATkAAACTTA 


GGATTAAA6A ATAAATTAAT CCTTGTCCAA 


AATGAGTATA 


3780 


ACTAACTCTG 


CAATATACAC 


ATTCCCAGAA 


TCATCATTCT CTGAAAATGG 


TCATATAGAA 


3840 


CCATTACCAC 


TCAAAGTCAA 


TGAACAGAGG 


A7UVGCAGTAC CCCACATTAG 


AGTTGCCAAG 


3900 


ATCGGAAATC 


CACCAAAACA 


CGGATCCCGG 


TATTTAGATG TCTTCTTACT 


CGGCTTCTTC 


3960 


GAGATGGAAC 


GAATCAAAGA 


CAAATACGGG 


AGTGTGAATG ATCTCGACAG 


TGACCCGAGT 


4020 
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TACAAAGTTT GTGGCTCTGG ATCATTACCA ATCGGATTGG CTAAGTACAC TGGGAAT6AC 
CAGGAATTGT TACAAGCCGC AACCAAACTG GATATAGAAG TGAGAAGAAC AGTCAAAGCG 
AAAGAGATGG TTGTTTACAC GGTACAJ\AAT ATAAAACCAG AACTGTACCC ATGGTCCAAT 
AGACTAAGAA AAGGAATGCT GTTCGATGCC AACAAAGTTG CTCTTGCTCC TCAATGTCTT 
CCACTAGATA GGAGCATAAA ATTTAGAGTA ATCTTCGTGA ATTGTACGGC AATTGGATCA 
ATAACCTTGT TCAAAATTCC TAAGTCAATG GCATCACTAT CTCTACCCAA CACAATATCA 
ATCAATCTGC AGGTACACAT AAAAACAGGG GTTCAGACTG ATTCTAAAGG GATAGTTCAA 
ATTTTGGATG AGAAAGGCGA AAAATCACTG AATTTCATGG TCCATCTCGG ATTGATCAAA 
AGAA7AGTAG GCAGAATGTA CTCTGTTGAA TACTGTAAAC AGAAAATCGA GAAAATGAGA 
TTGATATTTT CTTTAGGACT AGTTGGAGGA ATCAGTCTTC ATGTCAATGC AACTGGGTCC 
ATATCAAAAA CACTAGCAAG TCAGCTGGTA TTCAAAAGAG AGATTTGTTA TCCTTTAATG 
GATCTAAATC CGCATCTCAA TCTAGTTATC TGGGCTTCAT CAGTAGAGAT TACAAGAGTG 
GATGCAATTT TCCAACCTTC TTTACCTGGC GAGTTCA6AT ACTATCCTAA TATTATTGCA 
A7UVGGAGTTG GGAAAATCAA ACAATGGAAC TAGTAATCTC TATTTTAGTC CGGACGTATC 
TATTAAGCCG AAGCAAATAA AGGATAATCA AAAACTTAGG ACAAAAGAGG TCAATACCAA 

caactattag cagtcacact cgcaagaata agagagaagg gaccaaaaaa gtcaaatagg 
agaaatcaaa ACAAAAGGTA cagaacacca gaacaacaaa atcaaaacat ccaactcact 
caaaacaaaa attccaaaag agaccggcaa cacaacaagc actgaacaca atgccaactt 
caatactgct aattattaca accatgatca tggcatcttt ctgccaaata gatatcacaa 

AACTACAGCA CGTAGGTGTA TTGGTCAACA GTCCCAAAGG GATGAAGATA TCACAAAACT 
TTGAAACAAG ATATCTAATT TTGAGCCTCA TACCAAAAAT AGAAGACTCT AACTCTTGTG 
GTGACCAACA GATCAAGCAA TACAAGAAGT TATTGGATAG ACTGATCATC CCTTTATATG 
ATGGATTAAG ATTACAGAAA GATGTGATAG TAACCAATCA AGAATCCAAT GAAAACACTG 
ATCCCAGAAC AAAACGATTC TTTGGAGGGG TAATTGGAAC CATTGCTCTG GGAGTAGCAA 
CCTCAGCACA AATTACAGCG GCAGTTGCTC TGGTTGAAGC CAAGCAGGCA AGATCAGACA 
TCGAAAAACT CAAAGAAGCA ATTAGGGACA CAAACAAAGC AGTGCAGTCA GTTCAGAGCT 
CCTITAGGAAA TTTAATAGTA GCAATTAAAT CAGTCCAGGA TTATGTTAAC AAAGAAATCG 
TGCCATCGAT TGCGAGGCTA GGTTGTGAAG CAGCAGGACT TCAATTAGGA ATTGCATTAA 
CACAGCATTA CTCAGAATTA ACAAACATAT TTGGTGATAA CATAGGATCG TTACAAGAAA 
AAGGAATAAA ATTACAAGGT ATAGCATCAT TATACCGCAC AAATATCACA GAAATATTCyV 
CAACATCAAC AGTTGATAAA TATGATATCT ATGATCTGTT ATTTACAGAA TCAATAAAGG 
TGAGAGTTAT AGATGTTGAC TTGAATGATT ACTCAATCAC CCTCCAAGTC AGACTCCCTT 
TATTAACTAG GCTGCTGAAC ACTCAGATCT ACAAAGTAGA TTCCATATCA TATAACATCC 
AAAACAGAGA ATGGTATATC CCTCTTCCCA GCCATATCAT GACGAAAGGG GCATTTCTAG 
GTGGAGCAGA CGTCAAAGAA TGTATAGAAG CATTCAGCAG CTATATATGC CCTTCTGATC 



4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 




# 
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CAGGATTTGT ATTAAACCAT GAAATAGAGA GCTGCTTATC AGGAAACATA TCCCAATGTC 
CAAGAACAAC GGTCACATCA GACATTGTTC CAAGATATGC ATTTGTCAAT GGAGGAGTGG 
TTGCAAACTG TATAACAACC ACCTGTACAT GCAACGGAAT TGGTAATAGA ATCAATCTUVC 
CACCTGATCA AGGAGTAAAA ATTATAACAC ATAAAGAATG TAGTACAATA GGTATCAACG 
GAATGCTGTT CAATACAAAT AAAGAAGGAA CTCTTGCATT CTATACACCA AATGATATAA 
CACTAAACAA TTCTGTTGCA CTTGATCCAA TTGACATATC AATCGAGCTC AACAAGGCCA 
AATCAGATCT AGAAGAATCA AAAGAATGGA TAAGAAGGTC AAATCAAAAA CTAGATTCTA 
TTGGAAATTG GCATCAATCT AGCACTACAA TCATAATTAT TTTGATAATG ATCATTATAT 
TGTTTATAAT TAATATAACG ATAATTACAA TTGCAATTAA GTATTACAGA ATTCAAAAGA 
GAAATCGAGT GGATCAAAAT GACAAGCCAT ATGTACTAAC AAACAAATAA CATATCTACA 
GATCATTAGA TATTAAAATT ATA/^AAAACT TAGGAGTAAA GTTACGCAAT CCAACTCTAC 
TCATATAATT GAGGAAGGAC CCAATAGACA AATCCA/UITT CGAGATGGAA TACTGGAAGC 
ATACCAATCA CGGAAAGGAT GCTGGTAATG AGCTGGAGAC GTCTATGGCT ACTCATGGCA 
ACAAGCTCAC TAATAAGATA ATATACATAT TATGGACAAT AATCCTGGTG TTATTATCAA 
TAGTCTTCAT CATAGTGCTA ATTAATTCCA TCAAAAGTGA AAAGGCCCAC GAATCATTGC 
TGCAAGACAT AAATAATGAG TTTATGGAAA TTACAGAAAA GATCCAAATG GCATCGGATA 
ATACCAATGA TCTAATACAG TCAGGAGTGA ATACAAGGCT TCTTACAATT CAGAGTCATG 
TCCAGAATTA CATACCAATA TCATTGACAC AACAGATGTC AGATCTTAGG AAATTCATTA 
GTGAAATTAC AATTAGAAAT GATAATCAAG AAGTGCTGCC ACAAAGAATA ACACATGATG 
TAGGTATAAA ACCTTTAAAT CCAGATGATT TTTGGAGATG CACGTCTGGT CTTCCATCTT 
TAATGAAAAC TCCAAAAATA AGGTTAATGC CAGGGCCGGG ATTATTAGCT ATGCCAACGA 
CTGTTGATGG CTGTGTTAGA ACTCCGTCTT TAGTTATAAA TGATCTGATT TATGCTTATA 
CCTCAAATCT AATTACTCGA GGTTGTCAG6 ATATAGGAAA ATCATATCAA GTCTTACAGA 
TAGGGATAAT AACTGTAAAC TCAGACTTGG TACCTGACTT AAATCCTAGG ATCTCTCATA 
CCTTTAACAT AAATGACAAT AGGAAGTCAT GTTCTCTAGC ACTCCTAAAT ATAGATGTAT 
ATCAACTGTG TTCAACTCCC AAAGTTGATG AAAGATCAGA TTATGCATCA TCAGGCATAG 
AAGATATTGT ACTTGATATT GTCAATTATG ATGGTTCAAT CTCAACAACA AGATTTAAGA 
ATAATAACAT AAGCTTTGAT CAACCATATG CTGCACTATA CCCATCTGTT GGACCAGGGA 
TATACTACAA AGGCAAAATA ATATTTCTCG GGTATGGAGG TCTTGAACAT CCAATAAATG 
AGAATGTAAT CTGCMCACA ACTGGGTGCC CCGGGAAAAC ACAGAGAGAC TGTAATCAAG 
CATCTCATAG TACTTGGTTT TCAGATAGGA GGATGGTCAA CTCCATCATT GTTGTTGACA 
AAGGCTTAAA CTCAATTCCA AAATTGAAAG TATGGAC6AT ATCTATGCGA CAAAATTACT 
GGGGGTCAGA AGGAAGGTTA CTTCTACTAG GTAACAAGAT CTATATATAT ACAAGATCTA 
CAAGTTGGCA TAGCAAGTTA CAATTAGGAA TAATTGATAT TACTGATTAC AGTGATATAA 
GGATAAAATG GACATGGCAT AATGTGCTAT CAAGACCAGG AAACAATGAA TGTCCATGGG 



6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 
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GACATTCATG 


TCCAGATGGA 


TGTATAACAG 


GAGTATATAC 


TGATGCATAT 


CCACTCAATC 


8280 


CCACAGGGAG 


CATTGTGTCA 


TCTGTCATAT 


TAGACTCACA 


AAAATCGAGA 


GTGAACCCAG 


8340 


TCATAACTTA 


CTCAACAGCA 


ACCGAAAGAG 


TAAACGAGCT 


GGCCATCCTA 


AACAGAACAC 


8400 


TCTCAGCTGG 


ATATACAACA 


ACAAGCTGCA 


TTACACACTA 


TAACAAAGGA 


TATTGTTTTC 


8460 


ATATAGTAGA 


AATAAATCAT 


AAAAGCTTAA 


ACACATTTCA 


ACCCATGTTG 


TTCAAAACAG 


8520 


AGATTCCAAA 


AAGCTGCAGT 


TAATCATAAT 


TAACCATAAT 


ATGCATCAAT 


CTATCTATAA 


8580 


TACAAGTATA 


TGATAAGTAA 


TCAGCAATCA 


GACAATAGAC 


AAAAGGGAAA 


TATAAAAAAC 


8640 


TTAGGAGCAA 


AGCGTGCTCG 


GGAAATGGAC 


ACTGAATCTA 


ACAATGGCAC 


TGTATCTGAC 


8700 


ATACTCTATC 


CTGAGTGtCA 


CCTTAACTCT 


CCTATCGTTA 


AAGGTAAAAT 


AGCACAATTA 


8760 


CACACTATTA 


TGAGTCTACC 


TCAGCCTTAT 


GATATGGATG 


ACGACTCAAT 


ACTAGTTATC 


8820 


ACTAGACAGA 


AAATAAAACT 


TAATAAATTG 


GATAAAAGAC 


AACGATCTAT 


TAGAAGATTA 


8880 


AAATTAATAT 


TAACTGAAAA 


AGTGAATGAC 


TTAGGAAAAT 


ACACATTTAT 


CAGATATCCA 


8940 


GAAATGTCAA 


AAGAAATGTT 


CAAATTATAT 


ATACCTGGTA 


TTAACAGTAA 


AGTGACTGAA 


9000 


TTATTACTTA 


AAGCAGATAG 


AACATATAGT 


CAAATGACTG 


ATGGATTAAG 


AGATCTATGG 


9060 


ATTAATGTGC 


TATCAAAATT 


AGCCTCAAAA 


AATGATG6AA 


GCAATTATGA 


TCTTAATGAA 


9120 


GAAATTAATA 


ATATATCGAA 


AGTTCACACA 


ACCTATAAAT 


CAGATAAATG 


GTATAATCCA 


9180 


TTCAAAACAT 


GGTTTACTAT 


CAAGTATGAT 


ATGAGAAGAT 


TACAAAAAGC 


TCGAAATGAG 


9240 


ATCACTTTTA 


ATGTTGGGAA 


GGATTATAAC 


TTGTTAGAAG 


ACCAGAAGAA 


TTTCTTATTG 


9300 


ATACATCCAG 


AATTGGTTTT 


GATATTAGAT 


AAACAAAACT 


ATAATGGTTA 


TCTAATTACT 


9360 


CCTGAATTAG 


TATTGATGTA 


TTGTGACGTA 


GTCGAAGGCC 


GATGGAATAT 


AAGTGCATGT 


9420 


GCTAAGTTAG 


ATCCAAAATT 


ACAATCTATG 


TATCAGAAAG 


GTAATAACCT 


GTGGGAAGTG 


9480 


ATAGATA7VAT 


TGTTTCCAAT 


TATGGGAGAA 


AAGACATTTG 


ATGTGATATC 


GTTATTAGAA 


9540 


CCACTTGCAT 


TATCCTTAAT 


TCAAACTCAT 


GATCCTGTTA 


AACAACTAAG 


AGGAGCTTTT 


9600 


TTAAATCATG 


TGTTATCCGA 


GATGGAATTA 


ATATTTGAAT 


CTAGAGAATC 


GATTAAGGAA 


9660 


TTTCTGAGTG 


TAGATTACAT 


TGATAAAATT 


TTAGATATAT 


TTAATAAGTC 


TACAATAGAT 


9720 


GAAATAGCAG 


AGATTTTCTC 


TTTTTTTAGA 


ACATTTGGGC 


ATCCTCCATT 


AGAAGCTAGT 


9780 


ATTGCAGCAG 


AAAAGGTTAG 


AAAATATATG 


TATATTGGAA 


AACTUITTAAA 


ATTTQACACT 


9840 


ATTAATAAAT 


GTCATGCTAT 


CTTCTGTACA 


ATAATAATTA 


ACGGATATAG 


AGAGAGGCAT 


9900 


GGTGGACAGT 


GGCCTCCTGT 


GACATTACCT 


GATCATGCAC 


ACGAATTCAT 


CATAAATGCT 


9960 


TACGGTTCAA 


ACTCTGCGAT 


ATCATATGAA 


AATGCTGTTG 


ATTATTACCA 


GAGCTTTATA 


10020 


GGAATAAAAT 


TCAATAAATT 


CATAGAGCCT 


CAGTTAGATG 


AGGATTTGAC 


AATTTATATG 


10080 


AAAGATAAAG 


CATTATCTCC 


AAAAAAATCA 


AATTGGGACA 


CAGTTTATCC 


TGCATCTAAT 


10140 


TTACTGTACC 


GTACTAACGC 


ATCCAACGAA 


TCACGAAGAT 


TAGTTGAAGT 


ATTTATAGCA 


10200 


GATAGTAAAT 


TTGATCCTCA 


TCAGATATTG 


GATTATGTAG 


AATCTGGGGA 


CTGGTTAGAT 


10260 


GATCCAGAAT 


TTAATATTTC 


TTATAGTCTT 


AAAGAAAAAG 


AGATCAAACA 


GGAAGGTAGA 


10320 
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CTCTTTGCAA 


AAATGACATA 


CAAAATGAGA 


GCTACACAAG 


TTTTATCAGA 


GACCCTACTT 


10380 


GCAAATAACA 


TAGGAAAATT 


CTTTCAAGAA 


AATGGGATGG 


TGAAGGGAGA 


GATTG7VATTA 


10440 


CTTAAGAGAT 


TAACAACCAT 


ATCAATATCA 


GGAGTTCCAC 


GGTATAATGA 


AGTGTACAAT 


10500 


AATTCTAAAA 


GCCATACAGA 


TGACCTTAAA 


ACCTACAATA 


AAATAAGTAA 


TCTTAATTTG 


10560 


TCTTCTAATC 


AGAAATCAAA 


GAAATTTGAA 


TTCAAGTCAA 


CGGATATCTA 


CAATGATGGA 


10620 


TACGAGACTG 


TGAGCTGTTT 


CCTAACAACA 


GATCTCAAAA 


AATACTGTCT 


TAATTGGAGA 


10680 


TATGAATCAA 


CAGCTCTATT 


TGGAGAAACT 


TGCAACCAAA 


TATTTGGATT 


AAATAAATTG 


10740 


TTTAATTGGT 


TACACCCTCG 


TCTTGAAGGA 


AGTACAATCT 


ATGTAGGTGA 


TCCTTACTGT 


10800 


CCTCCATCAG 


ATAAAGAACA 


TATATCATTA 


GAGGATCACC 


CTGATTCTGG 


TTTTTACGTT 


10860 


CATAACCCAA 


GAGGGGGTAT 


AGAAGGATTT 


TGTCAAAAAT 


TATGGACACT 


CATATCTATA 


10920 


AGTGC7UVTAC 


ATCTAGCAGC 


TGTTAGAATA 


GGCGTGAGGG 


TGACTGCAAT 


GGTTCAAGGA 


10980 


GACAATCAAG 


CTATAGCTGT 


AACCACAAGA 


GTACCCAACA 


ATTATGACTA 


CAGAGTTAAG 


11040 


AAGGAGATAG 


TTTATT^GA 


TGTAGTGAGA 


TTTTTTGATT 


CATTAAGAGA 


AGTGATGGAT 


11100 


GATCTAGGTC 


ATGAACTTAA 


ATTAAATGAA 


ACGATTATAA 


GTAGCAAGAT 


GTTCATATAT 


11160 


AGCAAAAGAA 


TCTATTATGA 


TGGGAGAATT 


CTTCCTCAAG 


CTCTAAAAGC 


ATTATCTAGA 


11220 


TGTGTCTTCT 


GGTCAGAGAC 


AGTAATAGAC 


GAAACAAGAT 


CAGCATCTTC 


AAATTTGGCA 


11280 


ACATCATTTG 


CAAAAGCAAT 


TGAGAATGGT 


TATTCACCTG 


TTCTAGGATA 


TGCATGCTCA 


11340 


ATTTTTAAGA 


ATATTCAACA 


ACTATATATT 


GCCCTTGGGA 


TGAATATCAA 


TCCAACTATA 


11400 


ACACAGAATA 


TCAGAGATCA 


GTATTTTAGG 


AATCCAAATT 


GGATGCAATA 


TGCCTCTTTA 


11460 


ATACCTGCTA 


GTGTTGGGGG 


ATTCAATTAC 


ATGGCCATGT 


CAAGATGTTT 


TGTAAGGAAT 


11520 


ATTGGTGATC 


CATCAGTTGC 


CGCATTGGCT 


GATATTAAAA 


GATTTATTAA 


GGCGAATCTA 


11580 


TTAGACCGAA 


GTGTTCTTTA 


TAGGATTATG 


AATCAAGAAC 


CAGGTGAGTC 


ATCTTTTTTG 


11640 


GACTGGGCTT 


CAGATCCATA 


TTCATGCAAT 


TTACCACAAT 


CTCAAAATAT 


AACCACCATG 


11700 


ATAAAAAATA 


TAACAGCAAG 


GAATGTATTA 


CAAGATTCAC 


CAAATCCATT 


ATTATCTGGA 


11760 


TTATTCACAA 


ATACAATGAT 


AGAAGAAGAT 


GAAGAATTAG 


CTGAGTTCCT 


GATGGACAGG 


11B20 


AAGGTAATTC 


TCCCTAGAGT 


TGCACATGAT 


ATTCTAGATA 


ATTCTCTCAC 


AGGAATTAGA 


11880 


AATGCCATAG 


CTGGAATGTT 


AGATACGACA 


AAATCACTAA 


TTCGGGTTGG 


CATAAATAGA 


11940 


GGAGGACTGA 


CATATAGTTT 


GTTGAGGAAA 


ATCAGTAATT 


ACGATCTAGT 


ACAATATGAA 


12000 


ACACTAAGTA 


GGACTTTGCG 


ACTAATTGTA 


AGTGATAAAA 


TCAAGTATGA 


AGATATGTGT 


12060 


TCGGTAGACC 


TTGCCATAGC 


ATTGCGACAA 


AAGATGTGGA 


TTCATTTATC 


AG6AGGAAGG 


12120 


ATGATAAGTG 


GACTTGAAAC 


GCCTGACCCA 


TTAGAATTAC 


TATCTGGGGT 


AGTAATAACA 


12180 


GGATCAGAAC 


ATTGTAAAAT 


ATGTTATTCT 


TCAGAT6GCA 


CAAACCCATA 


TACTTGGATG 


12240 


TATTTACCCG 


GTAATATCAA 


AATAGGATCA 


GCAGAAACAG 


GTATATCGTC 


ATTAAGAGTT 


12300 


CCTTATTTTG 


GATCAGTCAC 


TGATGAAAGA 


TCTGAAGCAC 


AATTAGGATA 


TATCAAGAAT 


12360 


CTTAGTAAAC 


CTGCAAAAGC 


CGCAATAAGA 


ATAGCAATGA 


TATATACATG 


GGCATTTGGT 


12420 
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AATGATGAGA 


TATCTTGGAT 


GGAAGCCTCA 


CAGATAGCAC 


AAACACGTGC 


AAATTTTACA 


12480 


CTAGATAGTC 


TCAAAATTTT 


AACACCGGTA 


GCTACATCAA 


CAAATTTATC 


ACACAGATTA 


12540 


AAGGATACTG 


CAACTCA6AT 


GAAATTCTCC 


AGTACATCAT 


TGATCAGAGT 


CAGCAGATTC 


12600 


ATAACAATGT 


CCAATGATAA 


CATGTCTATC 


AAAGAAGCTA 


ATGAAACCAA 


AGATACTAAT 


12660 


CTTATTTATC 


AACAAATAAT 


GTTAACAGGA 


TTAAGTGTTT 


TCGAATATTT 


ATTTAGATTA 


12720 


AAAGAAACCA 


CAG6ACACAA 


CCCTATAGTT 


ATGCATCTGC 


ACATAGAAGA 


TGAGTGTTGT 


12780 


ATTAAAGAAA 


GTTTTAATGA 


TGAACATATT 


AATCCAGAGT 


CTACATTAGA 


ATTAATTCGA 


12840 


TATCCTG/^ 


GTAATGAATT 


TATTTATGAT 


AAAGACCCAC 


TCAAAGATGT 


GGACTTATCA 


12900 


AAACTTATGG 


TTATTAAAGA 


CCATTCTTAC 


AC7JVTTGATA 


TGAATTATTG 


GGATGATACT 


12960 


GACATCATAC 


ATGCAATTTC 


AATATGTACT 


GCAATTACAA 


TAGCAGATAC 


TATGTCACAA 


13020 


TTAGATCGAG 


ATAATTTAAA 


AGAGATAATA 


GTTATTGCAA 


ATGATGATGA 


TATTAATAGC 


13080 


TT7UVTCACTG 


AATTTTTGAC 


TCTTGACATA 


CTTGTATTTC 


TCAAGACATT 


TGGTGGATTA 


13140 


TTAGTAAATC 


AATTTGCATA 


CACTCTTTAT 


AGTCTAAAAA 


TAGAAGGTAG 


GGATCTCATT 


13200 


TGGGATTATA 


TAATGAGAAC 


ACTGAGAGAT 


ACTTCCCATT 


CAATATTAAA 


AGTATTATCT 


13260 


AATGCATTAT 


CTCATCCTAA 


AGTATTCAAG 


AGGTTCTGGG 


ATTGTGGAGT 


TTTAAACCCT 


13320 


ATTTATGGTC 


CTAATACTGC 


TAGTCAAGAC 


CAGATAAAAC 


TTGCCCTATC 


TATATGTGAA 


13380 


TATTCACTAG 


ATCTATTTAT 


GAGAGAATGG 


TTGAATGGTG 


TATCACTTGA 


AATATACATT 


13440 


TGTGACAGCG 


ATATGGAAGT 


TGCAAATGAT 


AGGAAACAAG 


CCTTTATTTC 


TAGACACCTT 


13500 


TCATTTGTTT 


GTTGTTTAGC 


AGAAATTGCA 


TCTTTCGGAC 


CTAACCTGTT 


AAACTT7ACA 


13560 


TACTTGGAGA 


GACTTGATCT 


ATTGAAACAA 


TATCTTGAAT 


TAAATATTAA 


AGAAGACCCT 


13620 


ACTCTTAAAT 


ATGTACATIAT 


ATCTGGATTA 


TTAATTAAAT 


CGTTCCCATC 


AACTGTAACA 


13680 


TACGTAAGAA 


AGACTGCAAT 


CAAATATCTA 


AGGATTCGCG 


GTATTAGTCC 


ACCTGAGGTA 


13740 


ATTGATGATT 


GGGATCCGGT 


AGAAGATGAA 


AATATGCTGG 


ATAACATTGT 


CAAAACTATA 


13800 


AATGATAACT 


GTAATAAAGA 


TAATAAAGGG 


AATAAAATTA 


ACAATTTCTG 


GGGACTAGCA 


13860 


CTTAAGAACT 


ATCAAGTCCT 


TAAAATCAGA 


TCTATAACAA 


GTGATTCTGA 


TGATAATGAT 


13920 


AGACTAGATG 


CTAATACTUIG 


TGGTTTGACA 


CTTCCTCAAG 


GAGGGAATTA 


TCTATCGCAT 


13980 


CAATTGAGAT 


TATTCGGAAT 


CAACAGCACT 


AGTTGTCTGA 


AAGCTCTTGA 


GTTATCACAA 


14040 


ATTTTAATGA 


AGGAAGTCAA 


TAAAGACAAG 


GACAGGCTCT 


TCCTGGGAGA 


AGGAGCAGGA 


14100 


GCTATGCTAG 


CATGTTATGA 


TGCCACATTA 


GGACCTGCAG 


TTAATTATTA 


TAATTCAGGT 


14160 


TTGAATATAA 


CAGATGTAAT 


TGGTCAACGA 


GAATTGAAAA 


TATTTCCTTC 


AGAGGTATCA 


14220 


TTAGTAGGTA 


AAAAATTAGG 


AAATGTGACA 


CAGATTCTTA 


ACAGGGTAAA 


AGTACTGTTC 


14280 


AATGGGAATC 


CTAATTCAAC 


ATGGATAGGA 


AATATGGAAT 


GTGAGAGCTT 


AATATGGAGT 


14340 


GAATTAAATG 


ATAAGTCCAT 


TGGATTAGTA 


CATTGTGATA 


TGGAAGGAGC 


TATCGGTAAA 


14400 


TCAGAAGAAA 


CTGTTCTACA 


TGAACATTAT 


AGTGTTATAA 


GAATTACATA 


CTTGATTGGG 


14460 


GATGATGATG 


TTGTTTTAGT 


TTCCAAAATT 


ATACCTACAA 


TCACTCCGAA 


TTGGTCTAGA 


14520 
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ATACTTTATC 


TATATATU^TT 


ATATTGGAAA 


GATGTAAGTA 


TAATATCACT 


CAAAACTTCT 


14580 


AATCCTGCAT 


CAACAGAATT 


ATATCTAATT 


TCGAAAGATG 


CATATTGTAC 


TATAATGGAA 


14640 


CCTAGTGAAA 


TTGTTTTATC 


AAAACTTAAA 


AGATTGTCAC 


TCTTGGAAGA 


AAATAATCTA 


14700 


TTAAAATG6A 


TCATTTTATC 


AAAGAAGAGG 


AATAATGAAT 


GGTTACATCA 


TGAAATCAAA 


14760 


GAA66AGAAA 


GAGATTATGG 


AATCATGAGA 


CCATATCATA 


TGGCACTACA 


AATCTTTGGA 


14820 


TTTCAAATCA 


ATTTAAATCA 


TCTGGCGAAA 


GAATTTTTAT 


CAACCCCAGA 


TCTGACTAAT 


14880 


ATCAACAATA 


TAATCCAAAG 


TTTTC7VGCGA 


ACAATAAAGG 


ATGTTTTATT 


TGAATGGATT 


14940 


AATATAACTC 


ATGATGATAA 


GAGACTVTAAA 


TTAGGCGGAA 


GATATAACAT 


ATTCCCACTG 


15000 


AAAAATAAGG 


GAAAGTTAAG 


ACTGCTATCG 


AGAAGACTAG 


TATTAAGTTG 


GATTTCATTA 


15060 


TCATTATCGA 


CTCGATTACT 


TACAGGTCGC 


TTTCCTGATG 


AAAAATTTGA 


ACATAGAGCA 


15120 


CAGACTGGAT 


ATGTATCATT 


AGCTGATACT 


GATTTAGAAT 


CATTAAAGTT 


ATTGTCGAAA 


15180 


AACATCATTA 


AGAATTACAG 


AGAGTGTATA 


GGATCAATAT 


CATATTGGTT 


TCTAACCAAA 


15240 


GAAGTTAAAA 


TACTTATGAA 


ATTGATCGGT 


GGTGCTAAAT 


TATTAGGAAT 


TCCCAGACAA 


15300 


TATAAAGAAC 


CCGAAGACCA 


GTTATTAGAA 


AACTACAATC 


AACATGATGA 


ATTTGATATC 


15360 


GATTAAAACA 


TAAATACAAT 


GAAGATATAT 


CCTAACCTTT 


ATCTTTAAGC 


CTAGGT^TAG 


15420 


ACAAAAAGTA 


AGAAAAACAT 


GTAATATATA 


TATACCAAAC 


AGAGTTCTTC 


TCTTGTTTGG 


15480 


TGGGTCGGCA 


TGGCATCTCC 


ACCTCCTCGC 


GGTCCGGACC 


TGGGCATCCG 


AAGGAGGACG 


15540 


CACGTCCACT 


CGGATGGCTA 


AGGGAGAGCC 


TGCAGTAGCA 


TAACCCCTTG 


GGGCCTCTAA 


15600 


ACGGGTCTTG 


AGGGGTTTTT 


TGCTGAAAGG 


AG6AACTATA 


TACGCGTCGA 


CGGGCCCCGC 


15660 


GCTCAC 












15666 



(2) INFORMATION FOR SEQ ID N0:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGATTTGCGC GCAATTTA/U^ TCATCTGG 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CCCAGGTCGG ACCGCGAGGA GGTGGAGATG CCATGCCAGC CCACCAAAAC AAGAGAAGAA 60 
CTCTGTTTGG 70 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 
GGCCCGTCGA CGCGTAATAC GACTCACTAT AGGACCAAAC AAGAG 45 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGGCATCACG TGCTAC 16 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATGTTGA6CC TATTTGATAC ATTTAATGCA CGTAGGOUVG AAAACATAAC AAAATCAGCC 60 

GGTGGAGCTA TCATTCCTGG ACAGAAAAAT ACTGTCTCTA TATTCGCCCT TGGACCGACA 120 

ATAACTGATG ATAATGAGTUV AATGACATTA GCTCTTCTAT TTCTATCTCA TTCACTAGAT 180 

AATGAGAAAC AACATGCACA AAGGGCAGGG TTCTTGGTGT CTTTATTGTC AATGGCTTAT 240 

GCCAATCCAG AGCTCTACCT AACAACAAAT GGAAGTAATG CAGATGTCAA GTATGTCATA 300 

TACATGATTG AGAAAGATCT AAAACGGCAA AAGTATGGAG GATTTGTGGT TAAGACGAGA 360 

GAGATGATAT ATGAAAAGAC AACTGATTGG ATATTTGGAA GTGACCTGGA TTATGATCAG 420 

GAAACTATGT TGCAGAACGG CAGGAACAAT TCAACAATTG AAGACCTTGT CCACACATTT 480 

GGGTATCCAT CATGTTTAGG AGCTCTTATA ATACAGATCT GGATAGTTCT GGTCAAAGCT 540 
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ATCACTAGTA 


TCTCAGGGTT 


AAGAAAAGGC 


TTTTTCACCC 


GATTGGAAGC 


TTTCAGACAA 


600 


GAT66AACAG 


TGCAGGCAGG 


GCTGGTATTG 


AGCGGTGACA 


CAGTGGATCA 


GATTGGGTCA 


660 


ATCATGCGGT 


CTCAACAGAG 


CTTGGTAACT 


CTTATGGTTG 


AAACATTAAT 


AACAATGAAT 


720 


ACCAGCAGAA 


ATGACCTCAC 


AACCATAGAA 


AA6AATATAC 


AAATTGTTGG 


CAACTACATA 


780 


AGAGATGCAG 


GTCTCGCTTC 


ATTCTTCAAT 


ACAATCAGAT 


ATGGAATTGA 


GACCAGAATG 


840 


GCAGCTTTGA 


CTCTATCCAC 


TCTCAGACCA 


GATATCAATA 


GATTAAAAGC 


TTTGATGGAA 


900 


CTGTATTTAT 


CAAAGGGACC 


ACGCGCTCCT 


TTCATCTGTA 


TCCTCAGAGA 


TCCTATACAT 


960 


GGTGAGTTCG 


CACCA6GCAA 


CTATCCTGCC 


ATATGGAGCT 


ATGCAATGGG 


GGTGGCAGTT 


1020 


GTACAAAATA 


GAGCCATGCA 


ACAGTATGTG 


ACGGGAAGAT 


CATATCTAGA 


CATTGATATG 


X080 


TTCCAGCTAG 


GACAAGCAGT 


AGCACGTGAT 


GCCGAAGCTC 


AAATGAGCTC 


AACACTGGAA 


H40 


GATGAACTTG 


GAGTGACACA 


CGAATCTAAA 


GAAAGCTTGA 


AGAGACATAT 


AAGGAACATA 


1200 


AACAGTTCAG 


AGACATCTTT 


CCACAAACCG 


ACAGGTGGAT 


CAGCCATAGA 


GATGGCAATA 


1260 


GATGAAGAGC 


CAGAACAATT 


CGAACATAGA 


GCAGATCAAG 


AACAAAATGG 


AGAACCTCAA 


1320 


TCATCCATAA 


TTCAATATGC 


CTGGGCAGAA 


GGAAATAGAA 


GCGATGATCA 


GACTGAGCAA 


1380 


GCTACAGAAT 


CTGACAATAT 


CAAGACCGAA 


CAACAAAACA 


TCAGAGACAG 


ACTAAACAAG 


1440 


AGACTCAACG 


ACAAGAAGAA 


ACAAAGCAGT 


CAACCACCCA 


CTAATCCCAC 


AAACAGAACA 


1500 


AACCAGGACG 


AAATAGATGA 


TCTGTTTAAC 


GCATTTGGAA 


GCAACTAAGT 


CGACGATCCG 


1560 


GCTGCTAACA 


AAGCCCGAAA 


GGAAGCTGAG 


TTGGCTGCTG 


CCACCGCTGA 


GCAATAACTA 


1620 


GCATAACCCC 


TTGGGGCCTC 


TAAACGGGTC 


TTGAGGGGTT 


TTTTGCTGAA 


AGGAGGAACT 


1680 


ATATCCGGAT 


CGAGATCAAT 


TCTGTGAGCG 


TATGGCAAAC 


GAAGGAAAAA 


TAGTTATAGT 


1740 


AGCCGCACTC 


GATGGGACAT 


TTCAACGTAA 


ACCGTTTAAT 


AATATTTTGA 


ATCTTATTCC 


1800 


ATTATCTGAA 


ATGGTGGTAA 


AACTAACTGC 


TGTGTGTATG 


AAATGCTTTA 


AGGAGGCTTC 


1860 


CTTTTCTAAA 


CGATTGGGTG 


AGGAAACCGA 


GATAGAAATA 


ATAGGAGGTA 


ATGATATGTA 


1920 


TCAATCGGTG 


TGTAGAAAGT 


GTTACATCGA 


CTCATAATAT 


TATATTTTTT 


ATCTAAAAAA 


1980 


CTAAAAATAA 


ACATTGATTA 


AATTTTAATA 


TAATACTTAA 


AAATGGATGT 


TGTGTCGTTA 


2040 


GATAAACCGT 


TTATGTATTT 


TGAGGAAATT 


GATAATGAGT 


TAGATTACGA 


ACCAGAAAGT 


2100 


GCAAATGAGG 


TCGCAAAT^ 


ACTGCCGTAT 


CAAG6ACAGT 


TAAAACTATT 


ACTAGGAGAA 


2160 


TTATTTTTTC 


TTAGTAAGTT 


ACAGCGACAC 


GGTATATTAG 


ATGGTGCCAC 


CGTAGTGTAT 


2220 


ATAGGATCTG 


CTCCCGGTAC 


ACATATACGT 


TATTTGAGAG 


ATCATTTCTA 


TAATTTAGGA 


2280 


GTGATCATCA 


AATGGATGCT 


AATTGACGGC 


CGCCATCATG 


ATCCTATTTT 


AAATGGATTG 


2340 


CGTGATGTGA 


CTCTAGTGAC 


TCGGTTCGTT 


GATGAGGAAT 


ATCTACGATC 


CATCAAAAAA 


2400 


CAACTGCATC 


CTTCTAAGAT 


TATTTTAATT 


TCTGATQTGA 


GATCCAAACG 


AGGAGGAAAT 


2460 


GAACCTAGTA 


CGGCGGATTT 


ACTAAGTAAT 


TACGCTCTAC 


AAAATGTCAT 


GATTAGTATT 


2520 


TTAAACCCCQ 


TGGCGTCTAG 


TCTTAAATGG 


AGATGCCCGT 


TTCCAGATCA 


ATGGATCAAG 


2580 


GACTTTTATA 


TCCCACACGG 


TAATAATJVTG 


TTACAACCTT 


TTGCTCCTTC 


ATATTCAGGG 


2640 
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CCGTCGTTTT 


ACAACGTCGT 


GACTGGGAAA 


ACCCTGGCGT 


TACCCAACTT 


AATCGCCTTG 


2700 


CAGCACATCC 


CCCTTTCGCC 


AGCTGGCGTA ATAGCGAAGA GGCCCGCACC 


GATCGCCCTT 


2760 


CCCAACAGTT 


GCGCAGCCTG 


AATGGCGAAT 


GGCGCGACGC 


GCCCTGTAGC 


GGCGCATTAA 


2820 


GCGCGGCGGG 


TGTGGTGGTT 


ACGCGCAGCG 


TGACCGCTAC 


ACTTGCCAGC 


GCCCTAGCGC 


2880 


CCGCTCCTTT 


CGCTTTCTTC 


CCTTCCTTTC 


TCGCCACGTT 


CX3CCGGCTTT 


CCCCGTCAAG 


2940 


CTCTAAATCG 


GGGGCTCCCT 


TTAGGGTTCC 


GATTTAGTGC 


TTTACGGCAC 


CTCGACCCCA 


3000 


AAAAACTTGA 


TTAGGGTGAT 


GGTTCACGTA 


GTGGGCCATC 


GCCCTGATAG 


ACGGTTTTTC 


3060 


GCCCTTTGAC 


GTTGGAGTCC 


ACGTTCTTTA ATAGTGGACT 


CTTGTTCCAA 


ACTGGAACAA 


3120 


CACTCAACCC 


TATCTCGGTC 


TATTCTTTTG 


ATTTATAAGG 


GATTTTGCCG 


ATTTCGGCCT 


3180 


ATTGGTTAAA AAATGAGCTG 


ATTTAACAAA AATTTAACGC 


GAATTTTAAC 


AAAATATTAA 


3240 


CGTTTACAAT 


TTCCCAGGTG 


6CACTTTTCG 


GGGAAATGTG 


CGCGGAACCC 


CTATTTGTTT 


3300 


ATTTTTCTAA ATACATTCAA 


ATATGTATCC 


GCTCATGAGA 


CAATAACCCT 


GATAAATGCT 


3360 


TCAATAATAT 


T6AAAAA6GA 


AGAGTATGAG 


TATTCAACAT 


TTCCGTGTCG 


CCCTTATTCC 


3420 


CTTTTTTGCG 


GCATTTTGCC 


TTCCTGTTTT 


TGCTCACCCA GAAACGCTGG 


TGAAAGTAAA 


3480 


AGATGCTGAA 


GATCAGTTGG 


GTGCACGAGT 


GGGTTACATC 


GAACTGGATC 


TCAACAGCGG 


3540 


TAAGATCCTT 


GAGAGTTTTC 


GCCCCGAAGA 


ACGTTTTCCA 


ATGATGAGCA 


CTTTTAAAGT 


3600 


TCTGCTATGT 


GGCGCGGTAT 


TATCCCGTAT 


TGACGCCGGG 


CAAGAGCAAC 


TCGGTCGCCG 


3660 


CATACACTAT 


TCTCAGAATQ 


ACTTGGTTGA 


GTACTCACCA 


GTCACAGAAA 


AGCATCTTAC 


3720 


6GATGGCATG 


ACAGTAAGAG 


AATTATGCAG 


TGCTGCCATA 


ACCATGAGTG 


ATAACACTGC 


3780 


GGCCAACTTA 


CTTCTGACAA 


CGATCGGAGG 


ACCGAA6GAG 


CTAACCGCTT 


TTTTGCACAA 


3840 


CATGGGGGAT 


CATGTAACTC 


GCCTTGATCG 


TTGGGAACCG 


GAGCTGAATG 


AAGCCATACC 


3900 


AAACGACGAG 


CGTGACACCA 


CGATGCCTGT 


AGCAATGGCA 


ACAACGTTGC 


GCAAACTATT 


3960 


AACTGGCGAA 


CTACTTACTC 


TAGCTTCCCG 


GCAACAATTA ATAGACTGGA 


TGGAGGCGGA 


4020 


TAAAGTT6CA 


GGACCACTTC 


TGCGCTCGGC 


CCTTCCGGCT 


GGCTGGTTTA TTGCTGATAA 


4080 


ATCTGGAGCC 


GGTGAGCGTG 


GGTCTCGCGG 


TATCATTGCA 


GCACTGGGGC 


CAGATGGTAA 


4140 


GCCCTCCCGT 


ATCGTAGTTA 


TCTACACGAC 


GGGGAGTCAG 


GCAACTATGG 


ATGAACGAAA 


4200 


TAGACAGATC 


GCTGAGATAG 


GTGCCTCACT 


GATTAAGCAT 


TGGTAACTGT 


CAGACCAAGT 


4260 


TTACTCATAT 


ATACTTTAGA 


TTGATTTAAA 


ACTTCATTTT 


TAATTTAAAA 


GGATCTAGGT 


4320 


GAAGATCCTT 


TTTGATAATC 


TCATGACCAA 


AATCCCTTAA 


CGTGAGTTTT 


CGTTCCACTG 


4380 


AGCGTCAGAC 


CCCGTAGAAA 


AGATCAAAGG 


ATCTTCTTGA 


GATCCTTTTT 


TTCTGCGCGT 


4440 


AATCTGCTGC 


TTGCAAACAA 


AAAAACCACC 


GCTACCAGCG 


GTGGTTTGTT 


TGCCGGATCA 


4500 


AGA6CTACCA 


ACTCTTTTTC 


CGAAGGTAAC 


TGGCTTCAGC 


AGAGCGCAGA 


TACCAAATAC 


4560 


TGTCCTTCTA GTGTAGCCGT 


AGTTAGGCCA 


CCACTTCAAG AACTCTGTAG 


CACCGCCTAC 


4620 


ATACCTCX3CT 


CTGCTAATCC 


TGTTACCAGT 


GGCTGCTGCC 


AGTGGCOATA AGTCGTGTCT 


4680 


TACCGGGTTG 


GACTCAAGAC 


GATAGTTACC 


GGATAAGGCG 


CAGCGGTCGG 


GCTGAACGGG 


4740 
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GGGTTCGTGC 


ACACAGCCCA 


GCTTGGAGCG 


AACGACCTAC 


ACCGAACTGA 


GATACCTACA 


4800 


GCGTGAGCTA 


TGA6AAAGCG 


CCACGCTTCC 


CGAAGGGAGA 


AAGGCGGACA 


GGTATCCGGT 


4860 


AAGCGGCAGG 


GTCGGAACAG 


GAGAGCGCAC 


GAGGGAGCTT 


CCAGG6GGAA 


ACGCCTGGTA 


4920 


TCTTTATAGT 


CCTGTCGGGT 


TTCGCCACCT 


CTGACTTGAG 


CGTC6ATTTT 


TGTGATGCTC 


4980 


GTCAGGGGGG 


CGGAGCCTAT 


GGAAAAACGC 


CAGCAACGCG 


GCCTTTTTAC 


GGTTCCTGGC 


5040 


CTTTTGCTGG 


CCTTTTGCTC 


ACATGTTCTT 


TCCTGCGTTA 


TCCCCTGATT 


CTGTGGATAA 


5100 


CCGTATTACC 


GCCTTTGAGT 


GAGCTGATAC 


CGCTCGCCGC 


AGCCGAACGA 


CCGAGCGCAG 


5160 


CGAGTCAGTG 


AGCGAGGAAG 


C6GAAGAGCG 


CCCAATACGC 


AAACCGCCTC 


TCCCCGCGCG 


5220 


TTGGCCGATT 


CATTAATGCA 


GCTGGCACGA 


CAGGTTTCCC 


GACTGGAAAG 


CGGGCA6TGA 


5280 


GCGCAACGCA 


ATTAATGTGA 


GTTAGCTCAC 


TCATTAGGCA 


CCCCAGGCTT 


TACACTTTAT 


5340 


GCTTCCGGCT 


CGTATGTTGT 


GTGGAATTGT 


GAGCGGATAA 


CAATTTCACA 


CAGGAAACAG 


5400 


CTATGACCAT 


GATTACGCCA 


AGCTTTTGCG 


ATCAATAAAT 


GGATCACAAC 


CAGTATCTCT 


5460 


TAACGATGTT 


CTTCGCAGAT 


GATGATTCAT 


TTTTTAAGTA 


TTTGGCTAGT 


CAAGATGATG 


5520 


AATCTTCATT 


ATCTGATATA 


TTGCAAATCA 


CTCAATATCT 


AGACTTTCTG 


TTATTATTAT 


5580 


TGATCCAATC 


AAAAAATAAA 


TTAGAAGCCG 


TGGGTCATTG 


TTATGAATCT 


CTTTCAGAGG 


5640 


TikATACAGACA 


ATTGACAAAA 


TTCACAGACT 


TTCAAGATTT 


TAAAAAACTG 


TTTAACAAGG 


5700 


TCCCTATTGT 


TACAGATGGA 


AGGGTCAAAC 


TTAATAAAGG 


ATATTTGTTC 


GACTTTGTGA 


5760 


TTAGTTTGAT 


GCGATTCAAA 


AAAGAATCCT 


CTCTAGCTAC 


CACCGCAATA 


GATCCTGTTA 


5820 


GATACATAGA 


TCCTCGTCGC 


7ATATCGCAT 


TTTCTAACGT 


GATGGATATA 


TTAAAGTCGA 


5880 


ATAAAGTGAA 


CAATAATTAA 


TTCTTTATTG 


TCy^TCATGAA 


CGGCGGACAT 


ATTCAGTTGA 


5940 


TAATCGGCCC 


CATGTTTTCA 


GGTAAAAGTA 


CAGAATTAAT 


TAGACGAGTT 


AGACGTTATC 


6000 


AAATAGCTCA 


ATATAAATGC 


GTGACTATAA 


AATATTCTAA 


CX3ATAATAGA 


TACGGAACGG 


6060 


GACTATGGAC 


GCATGATAAG 


AATAATTTTG 


AAGCATTGGA 


AGCAACTAAA 


CTATGTGATG 


6120 


TCTTGGAATC 


AATTACAGAT 


TTCTCCGTGA 


TAGGTATCGA 


TGAAGGACAG 


TTCTTTCCAG 


6180 


ACATTGTTGA 


ATTGATCTCG 


ATCCCGCGAA 


ATTAATACGA 


CTCACTATAG 


GGAGACCACA 


6240 


ACGGTTTCCC 


TCTAGCGGGA 


TCAATTCCGC 


CCCTCTCCCT 


CCCCCCCCCC 


TAACGTTACT 


6300 


GGCCGAAGCC 


GCTTGGAATA 


AGGCCGGTGT 


GCGTTTGTCT 


ATATGTTATT 


TTCCACCATA 


6360 


TTGCCGTCTT 


TTGGCAATGT 


GAGGGCCCGG 


AAACCTGGCC 


CTGTCTTCTT 


GACGAGCATT 


6420 


CCTAGGGGTC 


TTTCCCCTCT 


CGCCAAAGGA 


ATGCAAGGTC 


TGTTG7UVTGT 


CGTGAAGGAA 


6480 


GCAGTTCCTC 


TGGAAGCTTC 


TTGAAGACAA 


ACAACGTCTG 


TAGCGACCCT 


TTGCAGGCAG 


6540 


CGGAACCCCC 


CACCTGGCGA 


CAGGTGCCTC 


TGCGGCCAAA 


AGCCACGTGT 


ATAA6ATACA 


6600 


CCTGCAAA6G 


CGGCACAACC 


CCAGTGCCAC 


GTTGTGAGTT 


GGATAGTTGT 


GGAAAGAGTC 


6660 


AAATGGCTCT 


CCTCAAGCGT 


ATTCAACAAG 


GGGCTGAAGG 


ATGCCCAGAA 


GGTACCCCAT 


6720 


TGTATGGGAT 


CTGATCTGGG 


GCCTCGGTGC 


ACATGCTTTA 


CATGTGTTTA 


GTCGAGGTTA 


6780 


AAAAACGTCT 


AGGCCCCCCG 


AACCACGGGG 


ACGTGGTTTT 


CCTTTGAAAA 


ACACGATAAT 


6840 
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ACC 6843 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7107 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



ATGGAAAGCG 


ACGCTAAAAA 


CTATCAAATC 


ATGGATTCTT 


GGGAAGAGGA ATCAAGAGAT 


60 


AAATCAACTA 


ATATCTCCTC 


GGCCCTCAAC 


ATCATTGAAT 


TCATACTCAG 


CACCGACCCC 


120 


CAAGAAGACT 


TATCGGAAAA 


CGACACAATC 


AACACAAGAA 


CCCAGCAACT 


CAGTGCCACC 


180 


ATCTGTCAAC 


CA6AAATCAA 


ACCAACAGAA 


ACAAGTGAGA 


AA6ATAGTG6 


ATCAACTGAC 


240 


AAAAATAGAC 


AGTCCGGGTC 


ATCACACGAA 


TGTACAACAG 


AAGCAAAAGA 


TAGAAATATT 


300 


GATCAGGAAA 


CTGTACAGAG 


AGGACCTGGG 


AGAAGAAGCA 


GCTCAGATAG 


TAGAGCTGAG 


360 


ACTGTGGTCT 


CTG6AGGAAT 


CCCCAGT^GC 


ATCACAGATT 


CTAAAAATGG 


AACCCAAAAC 


420 


ACG6AGGATA 


TTGATCTCAA 


TGAAATTAGA 


AAGATGGATA 


AGGACTCTAT 


TGAGGGGAAA 


480 


ATGCGACAAT 


CTGCAAATGT 


TCCAAGCGAG 


ATATCAGGAA 


GTGATGACAT 


ATTTACAACA 


540 


GAACAAAGTA 


GAAACAGTGA 


TCATGGAAGA 


AGCCTGGAAT 


CTATCAGTAC 


ACCTGATACA 


600 


AGATCAATAA 


GTGTTGTTAC 


TGCTGCAACA 


CCAGATGATG 


AAGAAGAAAT 


ACTAATGAAA 


660 


AATAGTAGGA 


CAAAGT^AAAG 


TTCTTCAACA 


CATCAAGAAG 


ATGACAAAAG 


AATTAAAAAA 


720 


GGGG6AAAAG 


GGAAA6ACTG 


GTTTAAGA7A 


TCAAAAGATA 


CCGACAACCA 


GATACCAACA 


780 


TCAGACTACA 


GATCCACATC 


AAAA6G6CA6 


7UVGAAAATCT 


CAAAGACAAC 


AACCACCAAC 


840 


ACCGACACAA 


AGGGGCAAAC 


AGAAATACA6 


ACAGAATCAT 


CAGAAACACA ATCCTCATCA 


900 


TGGAATCTCA 


TCATCGACAA 


CAACACCGAC 


CGGAACGAAC 


AGACAAGCAC 


AACTCCTCCA 


960 


ACAACAACTT 


CCAGATCAAC 


TTATACAAAA 


GAATCGATCC 


GAACAAACTC 


TGAATCCAAA 


1020 


CCCAAGACAC 


AAAAGACAAA 


TGGAAAGGAA 


AGGAAG6ATA 


CAGAAGAGA6 


CAATCGATTT 


1080 


ACAGAGAGGG 


CAATTACTCT 


ATTGCAGAAT 


CTTGGTGTAA 


TTCAATCCAC 


ATCAAAACTA 


1140 


GATTTATATC 


AAGACAAACG 


AGTTGTATGT 


GTAGCAAATG 


TACTAAACAA 


TGTAGATACT 


1200 


6CATCAAAGA 


TAGATTTCCT 


G6CAGGATTA 


GTCATAGGGG 


TTTCAATGGA 


CAACGACACA 


1260 


AAATTAACAC 


AGATACAAAA 


TGAAATGCTA 


AACCTCAAAG 


CAGATCTAAA 


GAAAATGGAC 


1320 


GAATCACATA 


GAAGATTGAT 


AGAAAATCAA 


AGAGAACAAC 


TGTCATTGAT 


CACGTCACTA 


1380 


ATTTCATUVTC 


TCAAAATTAT 


GACTGA6AGA 


GGAGGAAAGA 


AAGACCAAAA 


TGAATCCAAT 


1440 


6AGAGAGTAT 


CCATGATCAA 


AACAAAATTG 


AAA6AAGAAA 


AGATCAAGAA 


GACCAGGTTT 


1500 


GACCCACTTA 


TGGAGGCACA 


AGGCATTGAC 


AAGAATATAC 


CCGATCTATA 


TCGACATGCA 


1560 


GGAGATACAC 


TAGAGAACGA 


TGTACAAGTT 


AAATCAGAGA 


TATTAAGTTC 


ATACAATGAG 


1620 
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TCAAATGCAA 


CAAGACTAAT 


ACCCAAAAAA 


GTGAGCAGTA 


CAATGAGATC 


ACTAGTTGCA 


1680 


GTCATCAACA 


ACAGCAATCT 


ctcaca;uvgc 


ACAAAACAAT 


CATACATAAA 


CGAACTCAAA 


1740 


CGTTGCAAAA 


ATGATGAAGA 


AGTATCTGAA 


TTAATGGACA 


TGTTCAATGA AGATGTCAAC 


1800 


AATTGCCAAT 


GAGTCGACGA 


TCCGGCTGCT 


AACAAAGCCC 


GAAAGGAAGC 


TGAGTTGGCT 


1860 


GCTGCCACCG 


CTGAGCAATA 


ACTAGCATAA 


CCCCTTGGGG 


CCTCTAAACG 


GGTCTTGAGG 


1920 


GGTTTTTTGC 


TGAAAGGAGG 


AACTATATCC 


GGATCGAGAT 


CAATTCTGTG 


AGCGTATGGC 


1980 


AAACGAAGGA 


AAAATAGTTA 


TAGTAGCCGC 


ACTCGATGGG 


ACATTTCAAC 


GTAAACCGTT 


2040 


TAATAATATT 


TTGAATCTTA 


TTCCATTATC 


TGAAATGGTG 


GTAAAACTAA 


CTGCTGTGTG 


2100 


TATGAAATGC 


TTTAAGGAGG 


CTTCCTTTTC 


TAAACGATTG 


GGTGAGGAAA 


CCGAGATAGA 


2160 


AATAATAGGA 


GGTAATGATA 


TGTATCAATC 


GGTGTGTAGA 


AAGTGTTACA 


TCGACTCATA 


2220 


ATATTATATT 


TTTTATCTAA 


aaaacta;^ 


ATAAACATTG 


ATTAAATTTT 


AATATAATAC 


2280 


TTAAAAATGG 


ATGTTGTGTC 


GTTAGATAAA 


CCGTTTATGT 


ATTTTGAGGA 


AATTGATAAT 


2340 


GAGTTAGATT 


ACGAACCAGA 


AAGTGCAAAT 


GAGGTCGCAA 


AAAAACTGCC 


GTATCAAGGA 


2400 


CA6TTAAAAC 


TATTACTAGG 


AGAATTATTT 


TTTCTTAGTA 


AGTTACAGCG 


ACACGGTATA 


2460 


TTAGATGGTG 


CCACCGTAGT 


GTATATAGGA 


TCtGCTCCCG 


GTACACATAT 


ACGTTATTTG 


2520 


AGAGATCATT 


TCTATAATTT 


AGGAGTGATC 


ATCAAATGGA 


TGCTAATTGA 


CGGCCGCCAT 


2580 


CATGATCCTA 


TTTTAAATGG 


ATTGCGTGAT 


GTGACTCTAG 


TGACTCGGTT 


CGTTGATGAG 


2640 


GAATATCTAC 


GATCCATCAA 


AT^CAACTG 


CATCCTTCTA 


AGATTATTTT 


AATTTCTGAT 


2700 


GTGAGATCCA 


AACGAGGAGG 


AAATGAACCT 


AGTACGGCGG 


ATTTACTAAG 


TAATTACGCT 


2760 


CTACAAAATG 


TCATGATTAG 


TATTTTAAAC 


CCCGTGGCGT 


CTAGTCTTAA 


ATGGAGATGC 


2820 


CCGTTTCCAG 


ATCAATGGAT 


CAAGGACTTT 


TATATCCCAC 


ACGGTAATAA 


AATGTTACAA 


2880 


CCTTTTGCTC 


CTTCATATTC 


AGGGCCGTCG 


TTTTACAACG 


TCGTGACTGG 


GAAAACCCTG 


2940 


GCGTTACCCA 


ACTTAATCGC 


CTTGCAGCAC 


ATCCCCCTTT 


CGCCAGCTGG 


CGTAATAGCG 


3000 


AAGAGGCCCG 


CACCGATCGC 


CCTTCCCAAC 


AGTTGCGCAG 


CCTGAATGGC 


GAATGGCGCG 


3060 


ACGCGCCCTG 


TAGCGGCGCA 


TTAAGCGCGG 


CGGGTGTGGT 


GGTTACGCGC AGCGTGACCG 


3120 


CTACACTTGC 


CAGCGCCCTA 


GCGCCCGCTC 


CTTTCGCTTT 


CTTCCCTTCC 


TTTCTCGCCA 


3180 


CGTTCGCCGG 


CTTTCCCCGT 


CAAGCTCTAA 


ATCGGGGGCT 


CCCTTTAGGG 


TTCCGATTTA 


3240 


GTGCTTTACG 


GCACCTCGAC 


CCCAAAAAAC 


TTGATTAGGG 


TGATGGTTCA 


CGTAGTGGGC 


3300 


CATCGCCCTG 


ATAGACGGTT 


TTTCGCCCTT 


TGACGTTGGA 


GTCCACGTTC 


TTTAATAGTG 


3360 


GACTCTTGTT 


CCAAACTGGA 


ACAACACTCA 


ACCCTATCTC 


GGTCTATTCT 


TTTGATTTAT 


3420 


AAGGGATTTT 


GCCGATTTCG 


GCCTATTGGT 


TAAAAAATGA 


GCTGATTTAA 


CA7UAATTTA 


3480 


ACGCGAATTT 


TAACAAAATA 


TTAACGTTTA 


CAATTTCCCA 


GGTGGCACTT 


TTCGGGGAAA 


3540 


TGT6CGCGGA 


ACCCCTATTT 


GTTTATTTTT 


CTAAATACAT 


TCAAATATGT 


ATCCGCTCAT 


3600 


GAGACAATAA 


CCCTGATAAA 


TGCTTCAATA 


ATATTGAT^ 


AGGAAGAGTA 


TGAGTATTCA 


3660 


ACATTTCCGT 


GTCGCCCTTA 


TTCCCTTTTT 


TGCGGCATTT 


TGCCTTCCTG 


TTTTTGCTCA 


3720 
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CCCAGAAACG 


CTGGTGAAAG 


TAAAAGATGC 


TGAAGATCAG 


TTGGGTGCAC 


GAGTGGGTTA 


3780 


CATCGAACTG 


GATCTCAACA 


GCGGTAAGAT 


CCTTGAGAGT 


TTTCGCCCCG 


AAGAACGTTT 


3840 


TCCAATGATG 


AGCACTTTTA 


AAGTTCTGCT 


ATGTGGCGCG 


GTATTATCCC 


GTATTGACGC 


3900 


CGGGCAAGAG 


CAACTCGGTC 


GCCGCATACA 


CTATTCTCAG 


AATGACTTGG 


TTGAGTACTC 


3960 


ACCAGTCACA 


GAAAAGCATC 


TTACGGATGG 


CATGACAGTA 


AGAGAATTAT 


GCAGTGCTGC 


4020 


CATAACCATG 


AGTGATAACA 


CTGCGGCCAA 


CTTACTTCTG 


ACAACGATCG 


GAGGACCGAA 


4080 


GGAGCTAACC 


GCTTTTTTGC 


ACAACATGGG 


GGATCATGTA 


ACTCGCCTTG 


ATCGTTGGGA 


4140 


ACC6GAGCTG 


AATGAAGCCA 


TACCAAACGA 


CGAGCGTGAC 


ACCACGATGC 


CTGTAGCAAT 


4200 


GGCAACAACG 


TTGCGCAAAC 


TATTAACTGG 


CGAACTACTT 


ACTCTAGCTT 


CCCGGCAACA 


4260 


ATTAATAGAC 


TGGATGGAGG 


CGGATAAAGT 


TGCAGGACCA 


CTTCTGCGCT 


CGGCCCTTCC 


4320 


GGCTGGCTGG 


TTTATTGCTG 


ATAAATCTGG 


AGCCGGTGAG 


CGTGGGTCTC 


GCGGTATCAT 


4380 


TGCAGCACTG 


GGGCCAGATG 


GTAAGCCCTC 


CCGTAXCGTA 


GTTATCTACA 


CGACGGGGAG 


4440 


TCA6GCAACT 


ATGGATGAAC 


GAAATAGACA 


GATCGCTGAG 


ATAGGTGCCT 


CACTGATTAA 


4500 


GCATTGGTAA 


CTGTCAGACC 


AAGTTTACTC 


ATATATACTT 


TAGATTGATT 


TAAAACTTCA 


4560 


TTTTTAATTT 


AAAAGGATCT 


AGGTGAAGAT 


CCTTTTTGAT 


AATCTCATGA 


CCAAAATCCC 


4620 


TTAACGTGAG 


TTTTCGTTCC 


ACTGAGCGTC 


AGACCCCGTA 


GAAAAGATCA 


AAGGATCTTC 


4680 


TTGAGATCCT 


TTTTTTCTGC 


GCGTAATCTG 


CTGCTTGCAA 


ACAAAAAAAC 


CACCGCTACC 


4740 


AGCGGTGGTT 


TGTTTGCCGG 


ATCAAGAGCT 


ACCAACTCTT 


TTTCCGAAGG 


TAACTGGCTT 


4800 


CAGCAGAGCG 


CAGATACCAA 


ATACTGTCCT 


TCTAGTGTAG 


CCGTAGTTAG 


GCCACCACTT 


4860 


CAAGAACTCT 


GTAGCACCGC 


CTACATACCT 


CGCTCTGCTA 


ATCCTGTTAC 


CAGTGGCTGC 


4920 


TGCCAGTGGC 


GATAAGTCGT 


GTCTTACCGG 


GTTGGACTCA 


AGACGATAGT 


TACCGGATAA 


4980 


G6CGCAGCGG 


TCGGGCTGAA 


CGGGGGGTTC 


GTGCACACAG 


CCCAGCTTGG 


AGCGAACGAC 


5040 


CTACACCGAA 


CTGAGATACC 


TACAGCGTGA 


GCTATGAGAA 


AGCGCCACGC 


TTCCCGAAGG 


5100 


GAGAAAGGCG 


GACAGGTATC 


CGGTAAGCGG 


CAGGGTCGGA 


ACAGGAGAGC 


GCACGAGGGA 


5160 


GCTTCCAGGG 


GGAAACGCCT 


GGTATCTTTA 


TAGTCCTGTC 


GGGTTTCGCC 


ACCTCTGACT 


5220 


TGAGCGTCGA 


TTTTTGTGAT 


GCTCGTCAGG 


GGGGCGGAGC 


CTATGGAAAA 


ACGCCAGCAA 


5280 


CGCGGCCTTT 


TTACGGTTCC 


TGGCCTTTTG 


CTGGCCTTTT 


GCTCACATGT 


TCTTTCCTGC 


5340 


GTTATCCCCT 


GATTCTGTGG 


ATAACCGTAT 


TACCGCCTTT 


GAGTGAGCTG 


ATACCGCTCG 


5400 


CCGCAGCCGA 


ACGACCGAGC 


GCAGCGAGTC 


AGTGAGCGAG 


GAAGCG6AAG 


AGCGCCCAAT 


5460 


ACGCAAACCG 


CCTCTCCCCG 


CGCGTTGGCC 


GATTCATTAA 


TGCAGCTGGC 


ACGACAG6TT 


5520 


TCCCGACTGG 


AAAGCGGGCA 


GTGAGCGCAA 


CGCAATTAAT 


GTGAGTTAGC 


TCACTCATTA 


5580 


GGCACCCCAG 


GCTTTACACT 


TTATGCTTCC 


GGCTCGTATG 


TTGTGTGGAA 


TTGTGAGCGG 


5640 


ATAACAATTT 


CACACAGGAA 


ACAGCTATGA 


CCATGATTAC 


GCCAAGCTTT 


TGCGATCAAT 


5700 


AAATGGATCA 


CAACCAGTAT 


CTCTTAACGA 


TGTTCTTCGC 


AGATGATGAT 


TCATTTTTTA 


5760 


AGTATTTGGC 


TAGTCAA6AT 


GATGAATCTT 


CATTATCTGA 


TATATTGCAA 


ATCACTCAAT 


5820 
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ATCTAGACTT 


TCTGTTATTA 


TTATTQATCC 


AATCAAAAAA 


TAAATTAGAA 


GCCGTGGGTC 


5880 


ATTGTTATGA 


ATCTCTTTCA 


GAGGAATACA 


GACAATTGAC 


AAAATTCACA 


GACTTTCAAG 


5940 


ATTTTAAA7JV 


ACTGTTTAAC 


AAGGTCCCTA 


TTGTTACAGA 


TGGAAGGGTC 


AAACTTAATA 


6000 


AAGGATATTT 


GTTCGACTTT 


GTGATTAGTT 


TGATGCGATT 


CAAT^AAAGAA 


TCCTCTCTAG 


6060 


CTACCACCGC 


AATAGATCCT 


GTTAGATACA 


TAGATCCTCX3 


TCGCAATATC 


GCATTTTCTA 


6120 


ACGTGATGGA 


TATATTAAAG 


TCGAATAAAG 


TG7UVCAATAA 


TTAATTCTTT 


ATTGTCATCA 


6180 


TGAACGGCGG 


ACATATTCAG 


TTGATAATCG 


GCCCCATGTT 


TTCAGGTAAA AGTACAGAAT 


6240 


TAATTAGACG 


AGTTAGACGT 


TATCAAATAG 


CTCAATATAA 


ATGCGTGACT 


ATAAAATATT 


6300 


CTJ\ACGATAA 


TAGATACGGA 


ACGGGACTAT 


GGACGCAT6A 


TAAGAATAAT 


TTTGAAGCAT 


6360 


TGGAAGCAAC 


TAAACTATGT 


GATGTCTTGG 


AATCAATTAC 


AGATTTCTCC 


GTGATAGGTA 


6420 


TCGATGAAGG 


ACAGTTCTTT 


CCAGACATTG 


TTGAATTGAT 


CTCGATCCCG 


CGAAATTAAT 


6480 


AqpACTCACT 


ATAGGGAGAC 


CACAACGGTT 


TCCCTCTAGC 


GGGATCAATT 


CCGCCCCTCT 


6540 


CCCTCCCCCC 


CCCCTAACGT 


TACTGGCCGA 


AGCCGCTTGG 


AATAAGGCCG 


GTGTGCGTTT 


6600 


GTCTATATGT 


TATTTTCCAC 


CATATTGCCG 


TCTTTTGGCA ATGTGAGGGC 


CCG6AAACCT 


6660 


G6CCCTGTCT 


TCTTGACGAG 


CATTCCTAGG 


GGTCTTTCCC 


CTCTCGCCAA AG6AATGCAA 


6720 


GGTCTGTTGA 


ATGTCGTGAA 


GGAAGCAGTT 


CCTCTGGT^G 


CTTCTTGAAG 


ACAAACAACG 


6780 


TCTGTAGCGA 


CCCTTTGCAG 


GCAGC6GAAC 


CCCCCACCTG 


GCGACAGGTG 


CCTCTGCGGC 


6840 


CAAAAGCCAC 


GTGTATAAGA 


TACACCTGCA 


AAGGCGGCAC 


AACCCCAGTG 


CCACGTTGTG 


6900 


AGTTGGATAG 


TTGTGGAAAG 


AGTCAAATGG 


CTCTCCTCAA 


GCGTATTCAA 


CAAGGGGCTG 


6960 


AAGGATGCCC 


AGAAGGTACC 


CCATTGTATG 


GGATCTGATC 


TGGGGCCTCG 


GTGCACATGC 


7020 


TTTACATGTG 


TTTAGTCGAG 


GTTAAAAAAC 


GTCTAGGCCC 


CCCGTACCAC 


GGGGACGTGG 


7080 


TTTTCCTTTG 


AAAAACACGA 


TAATACC 








7107 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12011 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ATGGACACTG AATCTAACAA TGGCACTGTA TCTGACATAC TCTATCCTGA GTGTCACCTT 60 

AACTCTCCTA TCGTTAAAGG TAAAATAGCA CAATTACACA CTATTATGAG TCTACCTCAG 120 

CCTTATGATA TGGATGACGA CTCAATACTA GTTATCACTA GACAGAAAAT AA/^CTTAAT 180 

AAATTGGATA AAAGACAACG ATCTATTAGA AGATTAAAAT TAATATTAAC TGAAAAAGTG 240 

AATGACTTAG GAAAATACAC ATTTATCAGA TATCCAGAAA TGTCAAAAGA AATGTTCAAA 300 

TTATATATAC CTGGTATTAA CAGTAAAGTG ACTGAATTAT TACTTAAAGC AGATAGAACA 360 
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TATAGTCAAA 


TGACTGATGG 


ATTAAGAGAT 


CTATGGATTA 


ATGTGCTATC 


AAAATTAGCC 


420 


TCAAAAAATG 


ATGGAAGCAA 


TTATGATCTT 


AATGAAGAAA 


TTAATAATAT 


ATCGAAAGTT 


480 


CACACAACCT 


ATAAATCAGA 


TAAATGGTAT 


AATCCATTCA 


AAACATGGTT 


TACTATCAAG 


540 


TATGATATGA 


GAAGATTACA 


AAAAGCTCGA AATGAGATCA 


CTTTTAATGT 


TGGGAAGGAT 


600 


TATAACTTGT TAGAAGACCA 


GAAGAATTTC 


TTATTGATAC 


ATCCAGAATT 


GGTTTTGATA 


660 


TTAGATAAAC 


AAAACTATAA 


TGGTTATCTA ATTACTCCTG 


AATTAGTATT 


GATGTATTGT 


720 


GACGTAGTCG 


AAGGCCGATG 


GAATATAAGT 


GCATGTGCTA 


AGTTAGATCC 


AAAATTACAA 


780 


TCTATGTATC 


AGAAAGGTAA 


TAACCTGTGG 


GAAGTGATAG 


ATAAATTGTT 


TCCAATTATG 


840 


G6AGAAAAGA 


CATTTGATGT 


GATATCGTTA 


TTAGAACCAC 


TTGCATTATC 


CTTAATTCAA 


900 


ACTCATGATC 


CTGTTAAACA 


ACTAAGAGGA 


GCTTTTTTAA 


ATCATGTGTT 


ATCCGAGATG 


960 


GAATTAATAT 


TTGAATCTAG 


AGAATCGATT 


AAGGAATTTC 


TGAGTGTAGA 


TTACATTGAT 


1020 


AAAATTTTAG 


ATATATTTAA 


TAAGTCTACA 


ATAGATGAAA 


TAGCAGAGAT 


TTTCTCTTTT 


1080 


TTTAGAACAT 


TTGGGCATCC 


TCCATTAGAA 


GCTAGTATTG 


CAGCAGAAAA 


GGTTAGAAAA 


1140 


TATATGTATA 


TTGGAAAACA ATTAT^AATTT 


GACACTATTA 


ATAAATGTCA 


TGCTATCTTC 


1200 


TGTACAATAA 


TAATTAACGG 


ATATAGAGAG 


AGGCATGGTG 


GACAGTGGCC 


TCCTGTGACA 


1260 


TTACCTGATC 


ATGCACACGA 


ATTCATCATA 


AATGCTTACG 


GTTCAAACTC 


TGCGATATCA 


1320 


TATGAAAATG 


CTGTTGATTA 


TTACCAGAGC 


TTTATAGGAA 


TAAAATTCAA 


TAAATTCATA 


1380 


GAGCCTCAGT 


TAGATGA6GA 


TTTGACAATT 


TATATGAAAG 


ATAAAGCATT 


ATCTCCAAAA 


1440 


AAATCA7UVTT 


6GGACACAGT 


TTATCCTGCA 


TCTAATTTAC 


TGTACCGTAC 


TAACGCATCC 


1500 


AACGAATCAC 


GAAGATTAGT 


TGAAGTATTT 


ATAGCAGATA 


GTAAATTTGA 


TCCTCATCAG 


1560 


ATATTGGATT 


ATGTAGAATC 


TGGGGACTGG 


TTAGATGATC 


CAG7ATTTAA 


TATTTCTTAT 


1620 


AGTCTTAAAG 


AAAAAGAGAT 


CAAACAGGAA 


GGTAGACTCT 


TTGCAAAAAT 


GACATACAAA 


1680 


ATGAGAGCTA 


CACAAGTTTT 


ATCAGAGACA 


CTACTTGCAA 


ATAACATAGG 


AAAATTCTTT 


1740 


CAAGAAAATG 


GGATGGTGAA 


GGGAGAGATT 


GAATTACTTA 


AGAGATTAAC 


AACCATATCA 


1800 


ATATCAGGAG 


TTCCACGGTA 


TAATGAAGTG 


TACAATAATT 


CTAAAAGCCA 


TACAGATGAC 


1860 


CTTAAAACCT 


ACAATAAAAT 


AAGTAATCTT 


AATTTGTCTT 


CTAATCAGAA 


ATCAAAG/UUV 


1920 


TTTGAATTCA AGTCAACGGA 


TATCTACAAT 


GATGGATACG 


AGACTGTGAG 


CTGTTTCCTA 


1980 


ACAACAGATC 


TCAAAAAATA 


CTGTCTTAAT 


TGGAGATATG 


AATCAACAGC 


TCTATTTGGA 


2040 


GAAACTTGCA 


ACCAAATATT 


TGGATTAAAT 


AAATTGTTTA 


ATTGGTTACA 


CCCTCGTCTT 


2100 


GAAGGAAGTA 


CAATCTATGT 


AGGTGATCCT 


TACTGTCCTC 


CATCAGATAA AGAACATATA 


2160 


TCATTAGAGG 


ATCACCCTGA 


TTCTGGTTTT 


TACGTTCATA ACCCAAGAGG 


GGGTATAGAA 


2220 


GGATTTTGTC 


AAAAATTATG 


GACACTCATA 


TCTATAAGTG 


CAATACATCT 


AGCAGCTGTT 


2280 


AGAATAGGCG 


TGAGG6TGAC 


TGCAATGGTT 


CAAGGAGACA 


ATCAAGCTAT 


AGCTGTAACC 


2340 


ACAAGAGTAC 


CCAACAATTA 


TGACTACAGA 


GTTAAGAAGG 


AGATAGTTTA 


TAAAGATGTA 


2400 


GTGAGATTTT 


TTGATTCATT 


AAGAGAAGTG 


ATGGATGATC 


TAGGTCATGA ACTTAAATTA 


2460 
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AATGAAACGA 


TTATAAGTAG 


CAAGATGTTC 


ATATATAGCA 


7UVAGAATCTA 


TTATGATGGG 


2520 


AGAATTCTTC 


CTCAAGCTCT 


AAAAGCATTA 


TCTAGATGTG 


TCTTCTGGTC 


AGAGACAGTA 


2580 


ATA6ACGAAA 


CAAGATCAGC 


ATCTTCAAAT 


TTGGCAACAT 


CATTTGCAAA 


AGCAATTGAG 


2640 


AATGGTTATT 


CACCT6TTCT 


AGGATATGCA 


TGCTCAATTT 


TTAAGAACAT 


TCAACAACTA 


2700 


TATATTGCCC 


TTGGGATGAA 


TATCAATCCA 


ACTATAACAC 


AGAATATCAG 


AGATCAGTAT 


2760 


TTTAGGAATC 


CAAATTGGAT 


GCAATATGCC 


TCTTTAATAC 


CTGCTAGTGT 


TGGGGGATTC 


2820 


AATTACATGG 


CCATGTCAAG 


ATGTTTTGTA 


AGGAATATTG 


GTGATCCATC 


AGTTGCCGCA 


2880 


TTGGCTGATA 


TTAA/^GATT 


TATTAAGGCG 


AATCTATTAG 


ACCGAAGTGT 


TCTTTATAGG 


2940 


ATTATGAATC 


AAGAACCAGG 


TGAGTCATCT 


TTTTTGGACT 


GGGCTTCAGA 


TCCATATTCA 


3000 


TGCAATTTAC 


CACAATCTCA 


AAATATAACC 


ACCATGATAA 


AAAATATAAC 


AGCAAGGAAT 


3060 


GTATTACAAG 


ATTCACCAAA 


TCCATTATTA 


TCTGGATTAT 


TCACAAATAC 


AATGATAGAA 


3120 


GAAGATGAAG 


AATTAGCTGA 


GTTCCTGATG 


GACAGGAAGG 


TAATTCTCCC 


TAGAGTTGCA 


3180 


CATGATATTC 


TAGATAATTC 


TCTCACAGGA 


ATTAGAAATG 


CCATAGCTGG 


AATGTTAGAT 


3240 


ACGACAAAAT 


CACTAATTCG 


GGTTGGCATA 


AATAGAGGAG 


GACTGACATA 


TAGTTTGTTG 


3300 


AGGAAAATCA 


GTAATTACGA 


TCTAGTACAA 


TATGAAACAC 


TAAGTAGGAC 


TTTGCGACTA 


3360 


ATTGTAAGTG 


ATAAAATCAA 


GTATGAAGAT 


ATGTGTTCGG 


TAGACCTTGC 


CATAGCATTG 


3420 


CGACAAAAGA 


TGTGGATTCA 


TTTATCAGGA 


GGAAGGATGA 


TAAGTGGACT 


TGAAACGCCT 


3480 


GACCCATTAG 


AATTACTATC 


TGGGGTAGTA 


ATAACAGGAT 


CAGAACATTG 


TAAAATATGT 


3540 


TATTCTTCAG 


ATGGCACAAA 


CCCATATACT 


TGGATGTATT 


TACCCGGTAA 


TATCAAAATA 


3600 


GGATCAGCAG 


AAACAGGTAT 


ATCGTCATTA 


AGAGTTCCTT 


ATTTTGGATC 


AGTCACTGAT 


3660 


GAAAGATCTG 


AAGCACAATT 


AGGATATATC 


AAGAATCTTA 


GTAAACCTGC 


AAAAGCCGCA 


3720 


ATAAGAATAG 


CAATGATATA 


TACATGGGCA 


TTTGGTAATG 


ATGAGATATC 


TTGGATGGAA 


3780 


GCCTCACAGA 


TAGCACAAAC 


ACGTGCAAAT 


TTTACACTAG 


ATAGTCTCAA 


AATTTTAACA 


3840 


CCGGTAGCTA 


CATCAACAAA 


TTTATCACAC 


AGATTAAAGG 


ATACTGCAAC 


TCAGATGAAA 


3900 


TTCTCCAGTA 


CATCATTGAT 


CAGAGTCAGC 


AGATTCATAA 


CAATGTCCAA 


TGATAACATG 


3960 


TCTATCAAAG 


AAGCTAATGA 


AACCAAAGAT 


ACTAATCTTA 


TTTATCT^CA 


AATAATGTTA 


4020 


ACAGGATTAA 


GTGTTTTCGA 


ATATTTATTT 


AGATTAAAAG 


AAACCACAGG 


ACACAACCCT 


4080 


ATAGTTATGC 


ATCTGCACAT 


AGAAGATGAG 


TGTTGTATTA 


AAGAAAGTTT 


TAATGATGAA 


4X40 


CATATTAATC 


CAGAGTCTAC 


ATTAGAATTA 


ATTCGATATC 


CTGAAAGTAA 


TGAATTTATT 


4200 


TATGATAAAG 


ACCCACTCAA 


AGATGTGGAC 


TTATCAAAAC 


TTATGGTTAT 


TAAAGACCAT 


4260 


TCTTACAC;^ 


TTGATATGAA 


TTATTGGGAT 


GATACTGACA 


TCATACATGC 


AATTTCAATA 


4320 


TGTACTGCAA 


TTACAATAGC 


AGATACTATG 


TCACAATTAG 


ATCGAGATAA 


TTTAAAAGAG 


4380 


ATAATAGTTA 


TTGCAAATGA 


TGATGATATT 


AATAGCTTAA 


TCACTGAATT 


TTTGACTCTT 


4440 


GACATACTTG 


TATTTCTCAA 


GACATTTGGT 


GGATTATTAG 


TAAATCAATT 


TGCATACACT 


4500 


CTTTATAGTC 


TAAAAATAGA 


AGGTAGGGAT 


CTCATTTGGG 


ATTATATAAT 


GAGAACACTG 


4560 
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AGAGATACTT 


CCCATTCAAT 


ATTAAAAGTA 


TTATCTAATG 


CATTATCTCA 


TCCTAAAGTA 


4620 


TTCAAGAGGT 


TCTGGGATTG 


TGGAGTTTTA 


AACCCTATTT 


ATGGTCCTAA 


TACTGCTAGT 


4680 


CAAGACCAGA 


TAAAACTTGC 


CCTATCTATA 


TGTGAATATT 


CACTAGATCT 


ATTTATGAGA 


4740 


GAATGGTTGA 


ATGGTGTATC 


ACTTGAAATA 


TACATTTGTG 


ACAGCGATAT 


GGAAGTTGCA 


4800 


AATGATAGGA 


AACAAGCCTT 


TATTTCTAGA 


CTICCTTTCAT 


TTGTTTGTTG 


TTTAGCAGAA 


4860 


ATTGCATCTT 


TCGGACCTAA 


CCTGTTAAAC 


TTAACATACT 


TGGAGAGACT 


TGATCTATTG 


4920 


AAACAATATC 


TTGAATTAAA 


TATTAAAGAA 


GACCCTACTC 


TTAAATATGT 


ACAAATATCT 


4980 


GGATTATTAA 


TTAAATCGTT 


CCCATCAACT 


GTAACATACG 


TAAGAAAGAC 


TGCAATCAAA 


5040 


TATCTAAGGA 


TTCGCGGTAT 


TAGTCCACCT 


GAGGTAATTG 


ATGATTGG6A 


TCCGGTAGAA 


5100 


GATGAAAATA 


TGCTGGATAA 


CATTGTCAAA 


ACTATAAATG 


ATAACTGTAA 


TAAAGATAAT 


5160 


AAAGGGAATA 


AAATTAACAA 


TTTCTGGGGA 


CTAGCACTTA 


AQAACTATCA 


AGTCCTTAAA 


5220 


ATCAGATCTA 


TAACAAGTGA 


TTCTGATGAT 


AATGATAGAC 


TAGATGCTAA 


TACAAGTGGT 


5280 


TTGACACTTC 


CTCAAGGAGG 


GAATTATCTA 


TCGCATCAAT 


TGAGATTATT 


CGGAATCAAC 


5340 


AGCACTAGTT 


GTCTGAAAGC 


TCTTGAGTTA 


TCACAAATTT 


TAATGAAGGA 


AGTCAATAAA 


5400 


GACAAGGACA 


GGCTCTTCCT 


GGGAGAAGGA 


GCAGGAGCTA 


TGCTAGCATG 


TTATGATGCC 


5460 


ACATTAGGAC 


CTGCAGTTAA 


TTATTATAAT 


TCAGGTTTGA 


ATATAACAGA 


TGTAATTGGT 


5520 


CAACGAGAAT 


TGAAAATATT 


TCCTTCAGAG 


GTATCATTAG 


TAGGTAAAAA 


ATTAGGAAAT 


5580 


GTGACACAQA 


TTCTTAACAG 


GGTAAAAGTA 


CTGTTCAATG 


GGAATCCTAA 


TTCAACATGG 


5640 


ATAGGAAATA 


TGGAATGTGA 


GAGCTTAATA 


TGGAGTGAAT 


TAAATGATAA 


GTCCATTGGA 


5700 


TTAGTACATT 


GTGATATGGA 


AGGAGCTATC 


GGTAAATCAG 


AAGAAACTGT 


TCTACATGAA 


5760 


CATTATAGTG 


TTATAAGAAT 


TACATACTTG 


ATTGGGGATG 


ATGATGTTGT 


TTTAGTTTCC 


5820 


AAAATTATAC 


CTACAATCAC 


TCCGAATTGG 


TCTAGAATAC 


TTTATCTATA 


TAAATTATAT 


5880 


TGGA7UVGATG 


TAAGTATAAT 


ATCACTCAAA 


ACTTCTAATC 


CTGCATCAAC 


AGAATTATAT 


5940 


CTAATTTCGA 


AAGATGCATA 


TTGTACTATA 


ATGGAACCTA 


GTGAAATTGT 


TTTATCAAAA 


6000 


CTTAAAAGAT 


TGTCACTCTT 


GGAAGAAAAT 


AATCTATTAA 


AATGGATCAT 


TTTATCAAAG 


6060 


AA6AGGAATA 


ATGAATGGTT 


ACATCATGAA 


ATCAMGAAG 


GAGAAAGAGA 


TTATGGAATC 


6120 


ATGAGACCAT 


ATCATATGGC 


ACTACAAATC 


TTTGGATTTC 


AAATCAATTT 


AAATCATCTG 


6180 


GCGAAAGAAT 


TTTTATCAAC 


CCCAGATCTG 


ACTAATATCA 


ACAATATAAT 


CCAAAGTTTT 


6240 


CAGCGAACAA 


TAAAGGATGT 


TTTATTTGAA 


TGGATTAATA 


TAACTCATGA 


TGATAAGAGA 


6300 


CATAAATTAG 


GCGGAAGATA 


TAACATATTC 


CCACTGAAAA 


ATAAGGGAAA 


GTTAAGACTG 


6360 


CTATCGAGAA 


GACTAGTATT 


AAGTTGGATT 


TCATTATCAT 


TATCGACTCG 


ATTACTTACA 


6420 


GGTCGCTTTC 


CTGATGAAAA 


ATTTGAACAT 


AGAGCACAGA 


CTGGATATGT 


ATCATTAGCT 


6480 


GATACTGATT 


TAGAATCATT 


AAAGTTATTG 


TCGAAAAACA 


TCATTAAGAA 


TTACAGAGAG 


6540 


TGTATAGGAT 


CAATATCATA 


TTGGTTTCTA 


ACCAAAGAAG 


TTAAAATACT 


TATGAAATTG 


6600 


ATTGGTGGTG 


CTAAATTATT 


AGGAATTCCC 


AGACAATATA 


AAGAACCCGA 


AGACCAGTTA 


6660 
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TTAGAAAACT 


ACAATCAACA 


TGATGAATTT 


GATATCGATT 


AAAACATAAA 


TACAATGTCG 


6720 


ACGATCCGGC 


TGCTAACAAA 


GCCCGT^GG 


AAGCTGAGTT 


GGCTGCTGCC 


ACCGCTGAGC 


6780 


AATAACTAGC 


ATAACCCCTT 


GGGGCCTCTA 


AACGGGTCTT 


GAGGGGTTTT 


TTGCTGAAAG 


6640 


GAGGAACTAT 


ATCCGGATCG 


AGATCAATTC 


TGTGAGCGTA 


TGGCAAACGA 


AGGAAAAATA 


6900 


GTTATAGTAG 


CCGCACTCGA 


TGGGACATTT 


CAACGTAAAC 


CGTTTAATAA 


TATTTTGAAT 


6960 


CTTATTCCAT 


TATCTGAAAT 


GGTGGTAAAA 


CTAACTGCTG 


TGTGTATGAA 


AT6CTTTAAG 


7020 


GAGGCTTCCT 


TTTCTAAACG 


ATTGGGTGAG 


GAAACCGAGA 


TAGAAATAAT 


AGGAGGTAAT 


7080 


GATATGTATC 


AATCGGTGTG 


TAGAAAGTGT 


TACATCGACT 


CATAATATTA 


TATTTTTTAT 


7140 


CTAAAAAACT 


AAAAATAAAC 


ATTGATTAAA 


TTTTAATATA 


ATACTTAAAA 


ATGGATGTTG 


7200 


TGTCGTTAGA 


TAAACCGTTT 


ATGTATTTTG 


AGGAAATTGA 


TAATGAGTTA 


GATTACGAAC 


7260 


CAGAAAGTGC 


AAATGAGGTC 


GCAAAAAAAC 


TGCCGTATCA 


AGGACAGTTA 


AAACTATTAC 


7320 


TAGGAGAATT 


ATTTTTTCTT 


AGTAAGTTAC 


AGCGACACGG 


TATATTAGAT 


GGTGCCACCG 


7380 


TAGTGTATAT 


AGGATCTGCT 


CCCX3GTACAC 


ATATACGTTA 


TTTGAGAGAT 


CATTTCTATA 


7440 


ATTTAGGAGT 


GATCATCAAA 


TGGATGCTAA 


TTGACGGCCX3 


CCATCATGAT 


CCTATTTTAA 


7500 


ATGGATTGCG 


TGATGTGACT 


CTAGTGACTC 


GGTTCGTTGA 


TGAGGAATAT 


CTACGATCCA 


7560 


TCAAATU^CA 


ACTGCATCCT 


TCTAAGATTA 


TTTTAATTTC 


TGATGTGAGA 


TCCAAACGAG 


7620 


GAGGAAATGA 


ACCTAGTACG 


GCGGATTTAC 


TAAGTAATTA 


CGCTCTACAA 


AATGTCATGA 


7680 


TTAGTATTTT 


AAACCCCGTG 


GCGTCTAGTC 


TTAAATGGAG 


ATGCCCGTTT 


CCAGATCAAT 


7740 


6GATCAAGGA 


CTTTTATATC 


CCACACGGTA 


ATAAAATGTT 


ACAACCTTTT 


GCTCCTTCAT 


7800 


ATTCAGGGCC 


GTCGTTTTAC 


AACGTCGTGA 


CTGGGAAAAC 


CCTGGCGTTA 


CCCAACTTAA 


7860 


TCGCCTTGCA 


GCACATCCCC 


CTTTC6CCAG 


CTGGCGTAAT 


AGCGAAGAGG 


CCCGCACCGA 


7920 


TCGCCCTTCC 


CAACAGTTGC 


GCAGCCTGAA 


TGGCGAATGG 


CGCGACGCGC 


CCTGTAGCGG 


7980 


CGCATTAAGC 


GCGGCGGGTG 


TGGTGGTTAC 


GCGCAGCGTG 


ACCGCTACAC 


TTGCCAGCGC 


8040 


CCTAGCGCCC 


GCTCCTTTCG 


CTTTCTTCCC 


TTCCTTTCTC 


GCCACGTTCG 


CCGGCTTTCC 


8100 


CCGTCAAGCT 


CTAAATCGGG 


GGCTCCCTTT 


AGGGTTCCGA 


TTTAGTGCTT 


TACGGCACCT 


8160 


CGACCCCAAA 


AAACTTGATT 


AGGGTGATGG 


TTCACGTAGT 


GGGCCATCGC 


CCTGATAGAC 


8220 


GGTTTTTCGC 


CCTTTGACGT 


TGGAGTCCAC 


GTTCTTTAAT 


AGTGGACTCT 


TGTTCCA/^C 


8280 


TGGAACAACA 


CTCAACCCTA 


TCTCGGTCTA 


TTCTTTTGAT 


TTATAAGGGA 


TTTTGCCGAT 


8340 


TTCGGCCTAT 


TGGTTAAT^ 


AT6AGCTGAT 


TTAACAAAAA 


TTTAACGCGA 


ATTTTAACAA 


8400 


AATATTAACG 


TTTACAATTT 


CCCAGGTGGC 


ACTTTTCGGG 


GAAATGTGCG 


CGGAACCCCT 


8460 


ATTTGTTTAT 


TTTTCTAAAT 


ACATTCAAAT 


ATGTATCCGC 


TCATGAGACA 


ATAACCCTGA 


8520 


TAAATGCTTC 


AATAATATTG 


AAAAAGGAAG 


AGTATGAGTA 


TTCTkACATTT 


CCGTGTCGCC 


8580 


CTTATTCCCT 


TTTTTGCGGC 


ATTTTGCCTT 


CCTGTTTTTG 


CTCACCCAGA 


AACGCTGGTG 


8640 


AAAGTAAAAG 


ATGCTGAAGA 


TCAGTTGGGT 


GCACGAGTGG 


GTTACATCX3A 


ACTGGATCTC 


8700 


AACAGCGGTA 


AGATCCTTGA 


GAGTTTTCGC 


CCCGAAGAAC 


GTTTTCCAAT 


GATGAGCACT 


8760 
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TTTAAAGTTC 


TGCTATGTGG 


CGCGGTATTA 


TCCCGTATTG 


ACGCCGGGCA 


AGAGCAACTC 


8820 


GGTCGCCGCA 


TACACTATTC 


TCAGAATGAC 


TTGGTTGAGT 


ACTCACCAGT 


CACAGAAAAG 


8880 


CyVTCTTACGG 


ATGGCATGAC 


AGTAAGAGAA 


TTATGCAGTG 


CTGCCATAAC 


CATGAGTGAT 


8940 


AACACTGCGG 


CCAACTTACT 


TCTGACAACG 


ATCGGAGGAC 


CGAAGGAGCT 


AACCGCTTTT 


9000 


TTGCACAACA 


TGGGGGATCA 


TGTAACTCGC 


CTT6ATCGTT 


GGGAACCGGA 


GCTGAATGAA 


9060 


GCCATACCAA 


ACGACGAGCG 


TGACACCACG 


ATGCCTGTAG 


CAATGGCAAC 


AACGTTGCGC 


9120 


AAACTATTAA 


CTGGCXIAACT 


ACTTACTCTA 


GCTTCCCGGC 


AACAATTAAT 


AGACTGGATG 


9180 


GAGGCGGATA AA6TTGCAGG 


ACCACTTCTG 


CGCTCGGCCC 


TTCCGGCTGG 


CTGGTTTATT 


9240 


GCTGATAAAT 


CTGGAGCCGG 


TGAGCGTGGG 


TCTCGCGGTA 


TCATTGCAGC 


ACTGGGGCCA 


9300 


GATGGTAAGC 


CCTCCCGTAT 


CGTAGTTATC 


TACACGACGG 


GGAGTCAGGC 


AACTATGGAT 


9360 


GAACGAAATA 


GACAGATCGC 


TGAGATAGGT 


GCCTCACTGA 


TTAAGCATTG 


GTAACTGTCA 


9420 


GACCAAGTTT 


ACTCATATAT 


ACTTTAGATT 


GATTTAAAAC 


TTCATTTTTA 


ATTTATIAAGG 


9480 


ATCTAGGTGA AGATCCTTTT 


TGATAATCTC 


ATGACCAAAA 


TCCCTTAACG 


TGAGTTTTCG 


9540 


TTCCACTGAG 


CGTCAGACCC 


CGTAGAAAAG 


ATCAAAGGAT 


CTTCTTGAGA 


TCCTTTTTTT 


9600 


CTGCGCGTAA 


TCTGCTGCTT 


GCAAACAAAA 


AAACCACCGC 


TACCAGCGGT 


GGTTTGTTTG 


9660 


CCGGATCAAG 


AGCTACCAAC 


TCTTTTTCCG 


AAGGTAACTG 


GCTTCAGCAG 


AGCGCAGATA 


9720 


CCAAATACTG 


TCCTTCTAGT 


GTAGCCGTAG 


TTAGGCCACC 


ACTTCAAGAA 


CTCTGTAGCA 


9780 


CCGCCTACAT 


ACCTCGCTCT 


GCTAATCCTG 


TTACCAGTGG 


CTGCTGCCAG 


TGGCGATAAG 


9840 


TCGTGTCTTA 


CCGGGTTGGA 


CTCAAGACGA 


TAGTTACCGG 


ATAAGGCGCA 


GC6GTCGGGC 


9900 


TGAACGGGG6 


GTTCGTGCAC 


ACAGCCCAGC 


TTGGAGCGAA 


CGACCTACAC 


CGAACTGAGA 


9960 


TACCTACAGC 


GTGAGCTATG 


AGAAAGCGCC 


ACGCTTCCCG 


AAGGGAGAAA 


GGCGGACAGG 


10020 


TATCCGGTAA 


GCGGCAGGGT 


CGGAACAGGA 


GAGCGCACGA 


GGGAGCTTCC 


AGGGGGAAAC 


10080 


GCCTGGTATC 


TTTATAGTCC 


TGTCGGGTTT 


CGCCACCTCT 


GACTTGAGCG 


TCGATTTTTG 


10140 


TGATGCTCGT 


CAGGGGGGCG 


GAGCCTATGG 


AAAAACGCCA 


GCAACGCGGC 


CTTTTTACGG 


10200 


TTCCTGGCCT 


TTTGCTGGCC 


TTTTGCTCAC 


ATGTTCTTTC 


CTGCGTTATC 


CCCTGATTCT 


10260 


GTGGATAACC GTATTACCGC 


CTTTGAGTGA 


GCTGATACCQ 


CTCX3CCGCAG 


CCGAACGACC 


10320 


GAGCGCAGCG AGTCAGTGAG 


CGAGGAAGCG 


GAAGAGCGCC 


CAATACGCAA 


ACCGCCTCTC 


10380 


CCCGCGCGTT 


GGCCGATTCA 


TTAATGCAGC 


TGGCACGACA 


GGTTTCCCGA 


CTGGAAAGCG 


10440 


GGCAGTGAGC 


GCAACGCAAT 


TAATGTGAGT 


TAGCTCACTC 


ATTAGGCACC 


CCAGGCTTTA 


10500 


CACTTTATGC 


TTCCGGCTCG 


TATGTTGTGT 


GGAATTGTGA 


GCGGATAACA 


ATTTCACACA 


10560 


GGAAACAGCT ATGACCATGA 


TTACGCCAAG 


CTTTTGCGAT 


CAATAAATGG 


ATCACAACCA 


10620 


GTATCTCTTA ACGATGTTCT 


TCGCAGATGA 


TGATTCATTT 


TTTAAGTATT 


TGGCTAGTCA 


10680 


AGATGATGAA 


TCTTCATTAT 


CTGATATATT 


GCAAATCACT 


CAATATCTAG 


ACTTTCTGTT 


10740 


ATTATTATTG 


ATCCAATCAA 


AAAATAAATT 


AGAAGCCGTG 


GGTCATTGTT 


ATGAATCTCT 


10800 


TTCAGAGGAA 


TACAGACAAT 


TGACTUUUITT 


CACAGACTTT 


CAAGATTTTA 


AAAAACTGTT 


10860 
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TAACAAGGTC 


CCTATTGTTA 


CAGATGGAAG 


GGTCAAACTT 


AATAAAGGAT 


ATTTGTTCGA 


10920 


CTTTGTGATT 


AGTTTGATGC 


GATTCAAAAA AGAATCCTCT 


CTAGCTACCA 


CCGCAATAGA 


10980 


TCCTGTTAGA 


TACATAGATC 


CTCGTCGCAA 


TATCGCATTT 


TCTAACGTGA 


TGGATATATT 


11040 


AAA6TCGAAT 


AAAGTGAACA 


ATAATTAATT 


CTTTATTGTC 


ATCATGAACG 


GCGGACATAT 


11100 


TCAGTTGATA 


ATCGGCCCCA 


TGTTTTCAGG 


TAAAAGTACA 


GAATTAATTA 


GACX3AGTTAG 


11160 


ACGTTATCAA 


ATAGCTCAAT 


ATAAATGCGT GACTATAAAA 


TATTCTAACG 


ATAATAGATA 


11220 


CGGAAC6GGA 


CTATGGAC6C 


ATGATAAGAA 


TAATTTTGAA 


GCATTGGAAG 


CAACTAAACT 


11280 


ATGTGATGTC 


TTGGT^TCAA 


TTACAGATTT 


CTCCGTGATA 


GGTATCGATG 


AAGGACAGTT 


11340 


CTTTCCAGAC 


ATTGTTGAAT 


TGATCTCGAT 


CCCGCGAAAT 


TAATACGACT 


CACTATAGGG 


11400 


AGACCACAAC 


GGTTTCCCTC 


TAGCGGGATC AATTCCGCCC 


CTCTCCCTCC 


CCCCCCCCTA 


11460 


ACGTTACTGG 


CCGAAGCCGC 


TTGGAATAAG 


GCCGGTGTGC 


GTTTGTCTAT 


ATGTTATTTT 


11520 


CCACCATATT 


QCCGTCTTTT 


GGCAATGTGA 


GGGCCCGGAA 


ACCTGGCCCT 


GTCTTCTTGA 


11580 


CGAGCATTCC 


TAGGGGTCTT 


TCCCCTCTCG 


CCAAAGGAAT 


GCAAGGTCTG 


TTGAATGTCG 


11640 


TGAA6GAAGC 


AGTTCCTCTG 


GAAGCTTCTT GAAGACAAAC 


AACGTCTGTA 


GCGACCCTTTT 


11700 


GCAGGCAGCG 


GAACCCCCCA 


CCTGGCGACA GGTGCCTCTG 


CGGCCAAAAG 


CCACGTGTAT 


11760 


AAGATACACC 


TGCAAAGGCG 


GCACAACCCC 


AGTGCCACGT 


TGTGAGTTGG 


ATAGTTGTGG 


11820 


TU^GAGTCAA 


ATGGCTCTCC 


TCAAGCGTAT 


TCAACAAGGG 


GCTGAAGGAT 


GCCCAGAAGG 


11880 


TACCCCATTG 


TATGGGATCT 


6ATCTGGGGC 


CTCGGTGCAC 


ATGCTTTACA 


TGTGTTTAGT 


11940 


CGAGGTTAAA 


AAACGTCTAG 


GCCCCCCGAA 


CCACGGGGAC 


GTGGTTTTCC 


TTTGAAAAAC 


12000 


ACGATAATAC 


C 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GATCGATGCT AGCCC 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
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GATCGGGCTA GCATC 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
TTACATGGCC AT 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
TCACATGGCG AT 

(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
TTTGGACTGG GC 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28 
TTTTGATTGG GC 

(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
TGGTCCTAAT ACTG 14 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
TGGGCCTAAT ATCG 14 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GCATTATCTA GATGTGTCTT CTGGTCAGAG 30 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CCTGAATTAT AATAATTAAC TGCAGGTCCT 30 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQX7ENCE DESCRIPTION: SEQ ID NO: 33 
GG67VAAGAAT CCAGAGACAA GAACGG 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
GGTGAAGTTG TGGATCCATT TGATT6 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35 
CAACCTGTAA GGTACCAGCA TCCG 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36 
GATATGGTGT TAGGCCTTGA TCTGTTC 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37 
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CGCCATGGAA AAATCAGAGA TCCTCTTCT 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 
CTGGATCCTA ATTGGAGTTG TTACCCATGT A 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 
AACCATGGCT GAAAi\AGGGA AAA 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 
GGTGAAGCTT AAGATGTGAT TTTACATATT TTA 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 
AAATAGGATC CCTACAGATC ATTAGATATT AAAAT 
(2) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) 5TRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:42: 
CGCCATGGTG TTCAGTGCTT GTTG 24 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
CCACAAGCTT AATTAACCAT AATATGCATC A 31 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
TTCCATGGAT TTGGATTTGT CTATTGGGT 29 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15462 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

ACCAAACAAG AGAAGAAACT TGCTTGGTAA TATAAATTTA ACTTAAAATT AACTTAGGAT 60 

TTAAGACATT GACTAGAAGG TCAAGAAAAG GGAACTCTAT AATTTCAAAA ATGTTGAGCC 120 

TATTTGATAC ATTTAATGCA CGTAGGCAAG AAAACATAAC AAAATCAGCC GGTGGAGCTA 180 

TCATTCCTGG ACAGAAAAAT ACTGTCTCTA TATTCGCCCT TGGACCGACA ATAACTGATG 240 

ATAATGAGAA AATGACATTA GCTCTTCTAT TTCTATCTCA TTCACTAGAT AATGAGJ\AAC 300 
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AACATGCACA 


AAGGGCAGGG 


TTCTTGGTGT 


CTTTATTGTC 


AATGGCTTAT 


GCCAATCCAG 


360 


AGCTCTACCT 


AACAACAAAT 


GGAAGTAATG 


CAGATGCCAA 


GTATGTCATA 


TACATGATTG 


420 


AGA/^GATCT 


AAAACGGCAA 


AAGTATGGAG 


GATTTGTGGT 


TAAGACGAGA 


GAGATGATAT 


480 


ATGAAAAGAC 


AACTGATTGG 


ATATTTGGAA 


GTGACCTGGA 


TTATGATCAG 


GAAACTATGT 


540 


TGCAGAACGG 


CAGGAACAAT 


TCAACAATTG 


AAGACCTTGT 


CCACACATTT 


GGGTATCCAT 


600 


CATGTTTAGG 


AGCTCTTATA 


ATACAGATCT 


GGATAGTTCT 


GGTCAAAGCT 


ATCACTAGTA 


660 


TCTCAGGGTT 


AAGAAAAGGC 


TTTTTCACCC 


GATTGGAAGC 


TTTCAGACAA 


GATGGAACAG 


720 


TGCAGGCAGG 


GCTGGTATTG 


AGCGGTGACA 


CAGTGGATCA 


GATTGGGTCA 


ATCATGCGGT 


780 


CTCAACAGAG 


CTTGGTAACT 


CTTATGGTTG 


AAACATTAAT 


AAC/VATGAAT 


ACCAGCAGAA 


840 


ATGACCTCAC 


AACCATAGAA 


AAGAATATAC 


AAATTGTTGG 


CAACTACATA 


AGAGATGCAG 


900 


GTCTCGCTTC 


ATTCTTCAAT 


ACAATCAGAT 


ATGGAATTGA 


GACCAGAATG 


GCAGCTTTGA 


960 


CTCTATCCAC 


TCTCAGACCA 


GATATCAATA 


GATTAAAAGC 


TTTGATGGTW^ 


CTGTATTTAT 


1020 


CAAAG6GACC 


AC6CGCTCCT 


TTCATCTGTA 


TCCTCAGAGA 


TCCTATACAT 


GGTGAGTTCG 


1080 


CACCAGGCAA 


CTATCCTGCC 


ATATGGAGCT 


ATGCAATGGG 


GGTGGCAGTT 


GTACAAAATA 


1140 


GAGCCATGCA 


ACAGTATGTG 


ACGGGAAGAT 


CATATCTAGA 


CATTGATATG 


TTCCAGCTAG 


1200 


GACAAGCAGT 


AGCACGTGAT 


GCCGAAGCTC 


AAATGAGCTC 


AACACTGGAA 


GATGAACTTG 


1260 


GAGTGACACA 


CGAAGCTAAA 


GAAAGCTTGA 


AGAGACATAT AAGGAACATA 


AACAGTTCAG 


1320 


AGACATCTTT 


CCACAAACCG 


ACAGGTGGAT 


CAGCCATAGA 


GATGGCAATA 


GATGAAGAGC 


1380 


CAGAACAATT 


CGAACATAGA 


GCAGATCAAG 


AACAAAATGG 


AGAACCTCAA 


TCATCCATAA 


1440 


TTCAATATGC 


CTGGGCAGAA 


GGAAATAGAA 


GCGATGATCA 


GACTGAGCAA 


GCTACAGAAT 


1500 


CTGACAATAT 


CAAGACCGAA 


CAACATUU^CA 


TCAGAGACAG 


ACTAAACAAG 


AGACTCAACG 


1560 


ACAAGAAGAA 


ACAAAGCAGT 


CAACCACCCA 


CTAATCCCAC 


AAACAGAACA 


AACCAGGACG 


1620 


AAATAGATGA 


TCTGTTTAAC 


GCATTTGGAA 


GCAACTAATC 


GAATCAACAT 


TTTAATCTAA 


1680 


ATCAATAATA 


AATAAGAAAA 


ACTTAGGATT 


AAAGAATCCT 


ATCATACCGG 


AATATAGGGT 


1740 


GGTAAATTTA 


GAGTCTGCTT 


G7UVACTCAAT 


CAATAGAGAG 


TTGATGGAAA 


GCGATGCTAA 


1600 


A/U^CTATCAA 


ATCATGQATT 


CTTGGGAAGA 


GGAATCAAGA 


GATAAATCAA 


CTAATATCTC 


1860 


CTCGGCCCTC 


AACATCATTG 


AATTCATACT 


CAGCACCGAC 


CCCCAAGAAG 


ACTTATCGGA 


1920 


AAACGACACA 


ATCAACACAA 


G7VACCCAGCA 


ACTCAGTGCC 


ACCATCTGTC 


AACCAGAAAT 


1980 


CAAACCAACA 


GAAACAAGTG 


AGAAAGATAG 


TGGATCAACT 


GACAAAAATA 


GACAGTCTGG 


2040 


GTCATCACAC 


GAATGTACAA 


CAGAAGCAAA 


AGATAGAAAC 


ATTGATCAGG 


AAACTGTACA 


2100 


GAGAGGACCT 


GGGAGAAGAA 


GCAGCTCAGA 


TAGTAGAGCT 


GAGACTGTGG 


TCTCTGGAGG 


2160 


AATCCCCAGA 


AGCATCACAG 


ATTCTAAAAA 


TGGAACCCAA 


AACACGGAGG 


ATATTGATCT 


2220 


CAATGAAATT 


AGAAAGATGG 


ATAAGGACTC 


TATTGAGGGG 


AAAATGCGAC 


AATCTGCAAA 


2280 


TGTTCCAAGC 


GAGATATCAG 


6AAGTGATGA 


CATATTTACA 


ACAGAACAAA 


GTAGAAACAG 


2340 


TGATCATGGA 


AGAAGCCTGG 


AATCTATCAG 


TACACCTGAT 


ACAAGATCAA 


TAAGTGTTGT 


2400 
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TACTGCTGCA 


ACACCAGATG 


ATGAAGAAGA 


AATACTAATG 


AAAAATAGTA 


GGACAAAGAA 


2460 


AAGTTCTTCA 


ACACATCAAG 


AAGATGACAA 


AA6AATTAAA 


AAAGGG6GAA 


AAGGGAAA6A 


2520 


CTGGTTTAAG 


7VAATCAAAAG 


ATACCGACAA 


CCAGATACCA ACATCAGACT 


ACAGATCCAC 


2580 


ATCAAAAGGG 


CAGAAGAAAA 


TCTCAAAGAC 


AACAACCACC 


AACACCGACA 


CAAAGGGGCA 


2640 


AACAGAAATA 


CAGACAGAAT 


CATCAGAAAC 


ACAATCCTCA 


TCATGGAATC 


TCATCATCGA 


2700 


CAACAACACC 


GACCGGAAC6 


AACAGACAAG 


CACAACTCCT 


CCAACAACAA 


CTTCCAGATC 


2760 


AACTTATACA 


AAAGAATCGA 


TCCGAACAAA 


CTCTGAATCC 


AAACCCAAGA 


CACAAAAGAC 


2820 


AAATGGAAAG 


GA7VAGGAAGG 


ATACAGAAGA 


GAGCAATCGA 


TTTACAGAGA 


GGGCAATTAC 


2880 


TCTATTGCAG 


AATCTTGGTG 


TAATTCAATC 


CACATCAAAA 


CTAGATTTAT 


ATCAAGACT^ 


2940 


ACGAGTTGTA 


TGTGTAGCAA ATGTACTAAA 


CAATGTAGAT 


ACTGCATCAA 


AGATAGATTT 


3000 


CCTGGCAGGA 


TTAGTCATAG 


GGGTTTCAAT 


GGACAACGAC 


ACAAAATTAA 


CACAGATACA 


3060 


AAATGAAATG 


CTAAACCTCA 


AAGCAGATCT 


AAAGAAAATG 


GACGAATCAC 


ATAGAAGATT 


3120 


GATAGAAAAT 


CTVAAGAGAAC AACTGTCATT GATCACGTCA CTAATTTCAA ATCTCAAAAT 


3180 


TATGACTGAG 


AGAGGAGGAA 


AGAAAGACCA 


AAATGAATCC 


AATGAGA6AG 


TATCCATGAT 


3240 


CAAAACAAAA 


TTGAAAGAAG 


AAAAGATCAA 


GAAGACCAGG 


TTTGACCCAC 


TTATGGAGGC 


3300 


ACAAGGCATT 


GACAAGAATA 


TACCCGATCT 


ATATCGACAT 


GCAGGAGATA 


CACTAGAGAA 


3360 


CGATGTACAA 


GTTAAATCAG 


AGATATTAAG 


TTCATACAAT 


GAGTCAAATG 


CAACAAGACT 


3420 


AATACCCAAA 


AAAGTGAGCA 


GTACAATGAG 


ATCACTAGTT 


GCAGTCATCA 


ACAACAGCAA 


3480 


TCTCTCACAA 


AGCACAAAAC 


AATCATACAT 


AAACGAACTC 


AAACGTTGCA 


AAAATGATGA 


3540 


AGAAGTATCT 


GAATTAATGG ACATGTTCAA TGAAGATGTC 


AACAATTGCC 


AATGATCCAA 


3600 


CAAAGAAACG 


ACACCGAACA AACAGACAAG 


AAACAACAGT 


AGATCAAAAC 


CTGTCAACAC 


3660 


ACACAAAATC 


AAGCAGAATG 


AAACAACAGA 


TATCAATCAA 


TATACAAATA 


AGAAAAACTT 


3720 


AGGATTAAAG 


AATAAATTAA 


TCCTTGTCCA 


AAATGAGTAT 


AACTAACTCT 


GCAATATACA 


3780 


CATTCCCAGA 


ATCATCATTC 


TCTGAAAATG 


GTCATATAGA ACCATTACCA 


CTCAAAGTCA 


3840 


ATGAACAGAG 


GAAAGCAGTA 


CCCCACATTA 


GAGTTGCCAA QATCGGAAAT 


CCACCAAAAC 


3900 


ACGGATCCCG 


GTATTTAGAT 


GTCTTCTTAC 


TCGGCTTCTT 


CGAGATGGAA 


CGAATCAAAG 


3960 


ACAAATACGG 


GAGTGTGAAT 


GATCTCGACA 


GTGACCCGAG 


TTACAAAGTT 


TGTGGCTCTG 


4020 


GATCATTACC 


AATCGGATTG 


GCTAAGTACA 


CTGGGAATGA 


CCAGGAATTG 


TTACAAGCCG 


4080 


CAACCAAACT 


G6ATATAGAA 


GTGAGAAGAA 


CAGTCAAAGC 


GAAAGAGATG 


GTTGTTTACA 


4140 


CG6TACAAAA 


TATAAAACCA 


GAACTGTACC 


CATGGTCCAA 


TAGACTAAGA 


AAAGGAATGC 


4200 


TGTTCGATGC 


CAACAAAGTT 


GCTCTTGCTC 


CTCAATGTCT 


TCCACTAGAT 


AGGAGCATAA 


4260 


AATTTAGAGT 


AATCTTGGTG 


AATTGTACGO 


CAATTGGATC 


AATAACCTTG 


TTCAAAATTC 


4320 


CTAAGTCAAT 


GGCATCACTA 


TCGTTAACCA 


ACACAATATC 


AATCAATCTG 


CAGGTACACA 


4380 


TAAAAACAGG 


GGTTCAGACT 


GATTCTAAAG 


GGATAGTTCA 


AATTTTG6AT 


GAGAAAGGCG 


4440 


AAAAATCACT 


GAATTTCATG 


GTCCATCTCG 


GATTGATCAA AA6AAAAGTA 


GGCAQAATGT 


4500 
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ACTCTGTTGA 


ATACTGTAAA 


CAGAAAATCX3 


AGAAAATGAG 


ATTGATATTT 


TCTTTAGGAC 


4560 


TAGTTGGAGG 


AATCAGTCTT 


CATGTCAATG 


CAACTGGGTC 


CATATCAAAA 


ACACTAGCAA 


4620 


GTCAGCTGGT 


ATTCAAAAGA 


GAGATTTGTT 


ATCCTTTAAT 


GGATCTAAAT 


CCGCATCTCA 


4680 


ATCTAGTTAT 


CTGGGCTTCA 


TCAGTAGAGA 


TTACAAGAGT 


GGATGCAATT 


TTCCAACCTT 


4740 


CTTTACCTGG 


CGAGTTCAGA 


TACTATCCTA 


ATATTATTGC 


AAAAGGAGTT 


GGGAAAATCA 


4800 


AACAAT6GAA 


CTAGTAATCT 


CTATTTTAGT 


CCGGACGTAT 


CTATTAAGCC 


GAAGCAAATA 


4860 


AAGGAT7VATC 


AAAAACTTAG 


GACAAAAGAG 


GTCAATACCA ACAACTATTA 


GCAGTCACAC 


4920 


TCGCAAGAAT 


AAGAGAGAAG 


6GACCAAAAA AGTCAAATAG 


GAGAAATCAA 


AACAAAAGGT 


4980 


ACAGAACACC 


AGAACAACAA 


AATCAAAACA 


TCCAACTCAC 


TCAAAACAAA 


AATTCCAAAA 


5040 


GAGACCGGCA 


ACACAAC7U\G 


CACTGAACAC 


AATGCCAACT 


TCAATACTGC 


TAATTATTAC 


5100 


AACCATGATC 


ATGGCATCTT 


TCTGCCAAAT 


AGATATCACA AAACTACAGC 


ACGTAGGTGT 


5160 


ATTGGTCAAC 


AGTCCCAAAG 


GGATGAAGAT 


ATCACAAAAC 


TTTGAAACAA 


GATATCTAAT 


5220 


TTTGAGCCTC 


ATACCAAAAA 


TAGAAGACTC 


TAACTCTTGT 


GGTGACCAAC 


AGATCAAGCA 


5280 


ATACAAGAAG 


TTATTGGATA 


GACTGATCAT 


CCCTTTATAT 


GATGGATTAA 


GATTACAGAA 


5340 


AGATGTGATA 


GTAACCAATC 


AAGAATCCAA 


TGAAAACACT 


GATCCCAGAA 


CAAAACGATT 


5400 


CTTTGGAGGG 


GTAATTGGAA 


CCATTGCTCT 


GGGAGTAGCA 


ACCTCAGCAC 


AAATTACAGC 


5460 


GGCAGTTGCT 


CTGGTTGAAG 


CCAAGCAGGC 


AAGATCAGAC 


ATCGAAAAAC 


TCAAAGAAGC 


5520 


AATTA6GGAC 


ACAAACAAAG 


CAGTGCAGTC 


AGTTCAGAGC 


TCCATAGGAA 


ATTTTUITAGT 


5580 


AGCAATTAAA 


TCAGTCCAGG 


ATTATGTTAA 


CAAA6AAATC 


GTGCCATCGA 


TTGCGAGGCT 


5640 


A6GTTGTGAA 


6CAGCAGGAC 


TTCAATTAGG 


AATTGCATTA 


ACACAGCATT 


ACTCAGAATT 


5700 


AACAAACATA 


TTTGGTGATA 


ACATAGGATC 


GTTACAAGAA 


AAAGGAATAA 


AATTACAAGG 


5760 


TATAGCATCA 


TTATACCGCA 


CAAATATCAC 


AGAAATATTC 


ACAACATCAA 


CAGTTGATAA 


5820 


ATATGATATC 


TATGATCTGT 


TATTTACAGA ATCAATAAAG GTGAGAGTTA 


TAGATGTTGA 


5880 


CTTGAATGAT 


TACTCAATCA 


CCCTCCAAGT 


CAGACTCCCT 


TTATTAACTA 


GGCTGCTGAA 


5940 


CACTCAGATC 


TACAAAGTAG 


ATTCCATATC 


ATATAACATC 


CAAAACAGAG 


AATGGTATAT 


6000 


CCCTCTTCCC 


AGCCATATCA 


TGACGAAAGG 


GGCATTTCTA 


GGTGGAGCAG 


ACGTCAAAGA 


6060 


ATGTATAGAA 


GCATTCAGCA 


GCTATATATG 


CCCTTCTGAT 


CCAGGATTTG 


TATTAAACCA 


6120 


TGAAATAGAG 


AGCTGCTTAT 


CAGGAAACAT 


ATCCCAATGT 


CCAAGAAC7VA 


CGGTCACATC 


6180 


AGACATTGTT 


CCAAGATATG 


CATTTGTCAA TGGAGGAGTG 


GTTGCAAACT 


GTATAACAAC 


6240 


CACCTGTACA 


TGCAACGGAA 


TTGGTAATAG 


AATCAATC7U\ 


CCACCTGATC 


AAGGAGTAAA 


6300 


AATTATAACA 


CATAAAGAAT 


GTAGTACTGT 


GGGTATCAAC 


GGAATGCTGT 


TCAATACAAA 


6360 


TAAAGAAGGA 


ACTCTTGCAT 


TCTATACACC 


AAATGATATA ACACTAAACA 


ATTCTGTTAC 


6420 


ACTGGATCCA 


ATTGACATAT 


CAATCGA6CT 


CAACAAGGCC 


AAATCAGATC 


TAGAAGAATC 


6480 


AAAAGAATG6 


ATAAGAAGGT 


CAAATCAAAA ACTAGATTCT ATTGGAAATT 


GGCATCAATC 


6540 


TAGCACTACA 


ATCATAATTA TTTTGATAAT 


GATCATTATA 


TTGTTTATAA 


TTAATATAAC 


6600 
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GATAATTACA 


ATTGCAATTA AGTATTACAG 


AATTCAAAAG 


AGAAATCGAG 


TGGATCAAAA 


6660 


T6ACAAGCCA 


TATGTACTAA 


CAAACAAATA 


ACATATCTAC 


AGATCATTAG 


ATATTAAAAT 


6720 


TATAAAAAAC 


TTAGGA6TAA 


AGTTACX3CAA 


TCCAACTCTA 


CTCATATAAT 


TGAGGAAGGA 


6780 


CCCAATAGAC 


AAATCCAAAT 


TCGAGATGGA ATACTGGAAG 


CATACCAATC 


ACGGAAAGGA 


6840 


T6CTGGTAAT 


GAGCTGGA6A 


CGTCTATGGC 


TACTCATGGC 


7ACAAGCTCA 


CTAATAAGAT 


6900 


AATATACATA 


TTATGGACAA 


TAATCCTGGT 


GTTATTATCA 


ATAGTCTTCA 


TCATAGTGCT 


6960 


AATTAATTCC 


ATCAAAAGTG 


AAAAGGCCCA 


CGAATCATTG 


CTGCAAGACA 


TAAATAATGA 


7020 


GTTTATGGAA 


ATTACAGAT^ 


AGATCCAAAT 


GGCATCGGAT 


AATACCAATG 


ATCTAATACA 


7080 


GTCAGGAGTG 


AATACAAGGC 


TTCTTACAAT 


TCAGAGTCAT 


GTCCAGAATT 


ACATACCAAT 


7140 


ATCATTGACA 


CAACAGATGT 


CAGATCTTAG 


GAAATTCATT 


AGTGAAATTA 


CAATTAGAAA 


7200 


TGATAATCAA GAAGTGCTGC 


CACAAAGAAT 


AACACATGAT 


GTAGGTATAA 


AACCTTTAAA 


7260 


TCCAGATGAT 


TTTTGGAGAT 


GCACGTCTGG 


TCTTCCATCT 


TTAATGAAAA 


CTCCAAAAAT 


7320 


AAGGTTAATG 


CCAGGGCCGG 


GATTATTAGC 


TATGCCAACG 


ACTGTTGATG 


GCTGTGTTAG 


7380 


AACTCCGTCT 


TTAGTTATAA 


ATGATCTGAT 


TTATGCTTAT 


ACCTCAAATC 


TAATTACTCG 


7440 


AGGTTGTCAG 


GATATAGGAA 


AATCATATCA 


AGTCTTACAG 


ATAGGGATAA 


TAACTGTAAA 


7500 


CTCAGACTTG 


GTACCTGACT 


TAAATCCTAG 


GATCTCTCAT 


ACCTTTAACA 


TAJ\ATGACAA 


7560 


TAGGAA6TCA 


TGTTCTCTAG 


CACTCCTAAA TATAGATGTA 


TATCAACTGT 


GTTCAACTCC 


7620 


CAAAGTTGAT 


GAAAGATCAG 


ATTATGCATC 


ATCAGGCATA 


GAAGATATTG 


TACTTGATAT 


7680 


TGTCAATTAT 


GATGGTTCAA 


TCTCAACAAC 


AAGATTTAAG 


AATAATAACA 


TAAGCTTTGA 


7740 


TCAACCATAT 


GCTGCACTAT 


ACCCATCTGT 


TGGACCAGGG 


ATATACTACA 


AAGGCTU^T 


7800 


AATATTTCTC 


GGGTATGGAG 


GTCTTGAACA 


TCCAATAAAT 


6AGAATGTAA 


TCTGCAACAC 


7860 


AACTGGGTGC 


CCCGGGAAAA 


CACAGAGAGA 


CTGTAATCAA 


GCATCTCATA 


GTACTTGGTT 


7920 


TTCAGATAGG AGGATGGTCA ACTCCATCAT TGTGGCTGAC AAAGGCTTAA ACTCAATTCC 


7980 


AAAATTGAAA GTATGGACGA 


TATCTATGCG ACAAAATTAC 


TGGGGGTCAG 


AAGGAAGGTT 


8040 


ACTTCTACTA 


GGTAACAAGA 


TCTATATATA 


TACAAGATCT 


ACAAGTTGGC 


ATAGCAAGTT 


8100 


ACAATTA6GA 


ATAATTGATA 


TTACTGATTA 


CAGTGATATA 


AGGATAAAAT 


GGACATGGCA 


8160 


TAATGTGCTA 


TCAAGACCAG 


GAAACAATGA ATGTCCATGG 


GGACATTCAT 


GTCCAGATGG 


8220 


ATGTATAACA 


GGAGTATATA 


CTGATGCATA 


TCCACTCAAT 


CCCACAGGGA 


GCATTGTGTC 


8280 


ATCTGTCATA 


TTAGACTCAC 


AAAAATCGAG 


AGTGAACCCA 


GTCATAACTT 


ACTCAACAGC 


8340 


AACCGAAAGA 


GTAAACGAGC 


TGGCCATCCT 


AAACAGAACA 


CTCTCAGCTG 


GATATACAAC 


8400 


AACAAGCTGC 


ATTACACACT 


ATAACAAAGG 


ATATTGTTTT 


CATATAGTAG 


AAATAAATCA 


8460 


TAAAAGCTTA 


AACACATTTC 


AACCCATGTT 


GTTCAAAACA 


GAGATTCCAA 


AAAGCTGCAG 


8520 


TTAATCATAA 


TTAACCATAA 


TATGCATCAA 


TCTATCTATA 


ATACAAGTAT 


ATGATAAGTA 


8580 


ATCAGCAATC 


AGACAATAGA 


CAAAAGGGAA ATATAAAAAA 


CTTAGGAGCA 


AAGCGTGCTC 


8640 


GGGAAAT6GA 


CACTGAATCT 


AACAATGGCA 


CTGTATCTGA 


CATACTCTAT 


CCTGAGTGTC 


8700 
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ACCTTAACTC 


TCCTATCGTT 


AAAGGTAAAA 


TAGCACAATT 


ACACACTATT 


ATGAGTCTAC 


8760 


CTCAGCCTTA 


TGATATGGAT 


GACGACTCAA 


TACTAGTTAT 


CACTAGACAG 


AAAATAAAAC 


8820 


TTAATAAATT 


GGATAAAAGA 


CAACGATCTA 


TTAGAAGATT 


AAAATTAATA 


TTAACTGAAA 


8880 


AAGT6AATGA 


CTTAGGAAAA 


TACACATTTA 


TCAGATATCC 


AGAAATGTCA 


AAAGAAATGT 


8940 


TCAAATTATA 


TATACCTGGT 


ATTAACAGTA 


AAGTGACTGA 


ATTATTACTT 


J\AAGCAGATA 


9000 


GAACATATAG 


TCAAATGACT 


GATGGATTAA 


GAGATCTATG 


GATTAATGTG 


CTATCAAAAT 


9060 


TAGCCTCAAA 


AAATGATGGA 


AGCAATTATG 


ATCTTAATGA 


AGAAATTAAT 


AATATATCGA 


9120 


AAGTTCACAC 


AACCTATAAA 


TCAGATAAAT 


GGTATAATCC 


ATTCAAAACA 


TGGTTTACTA 


9180 


TCAAGTATGA 


TATGAGAAGA 


TTACAAAAAG 


CTCGAAATGA 


GATCACTTTT 


AATGTTGGGA 


9240 


AGGATTATAA 


CTT6TTAGAA 


GACCAGAAGA 


ATTTCTTATT 


GATACATCCA 


GAATTGGTTT 


9300 


TGATATTAGA 


TAAACAAAAC 


TATAATGGTT 


ATCTAATTAC 


TCCTGAATTA 


GTATTGATGT 


9360 


ATTGTGACGT 


AGTCGAAGGC 


CGATGGAATA 


TAAGT6CATG 


TGCTAAGTTA 


GATCCAAAAT 


9420 


TACAATCTAT 


GTATCAGAAA 


GGTAATAACC 


TGTGGGAAGT 


GATAGATAAA 


TTGTTTCC7UV 


9480 


TTATGGGAGA 


AAAGACATTT 


GATGTGATAT 


CGTTATTAGA 


ACCACTTGCA 


TTATCCTTAA 


9540 


TTCAAACTCA 


TGATCCTGTT 


AAACAACTAA 


GAGGAGCTTT 


TTTAAATCAT 


GTGTTATCCG 


9600 


AGATGGAATT 


AATATTTGAA 


TCTAGAGAAT 


CGATTAAGGA 


ATTTCTGAGT 


GTAGATTACA 


9660 


TTGATAAAAT 


TTTAGATATA 


TTTAATAAGT 


CTACAATAGA 


TGAAATAGCA 


GAGATTTTCT 


9720 


CTTTTTTTAG 


AACATTTGGG 


CATCCTCCAT 


TAGAAGCTAG 


TATTGCAGCA 


GAAAAG6TTA 


9780 


GAAAATATAT 


GTATATTGGA 


AAACAATTAA 


AATTTGACAC 


TATTAATAAA 


TGTCATGCTA 


9840 


TCTTCTGTAC 


AATAATAATT 


AACGGATATA 


GAGAGAGGCA 


TGGTGGACAG 


TGGCCTCCTG 


9900 


TGACATTACC 


TGATCATGCA 


CACGAATTCA 


TCATAAATGC 


TTACGGTTCA 


AACTCTGCGA 


9960 


TATCATATGA 


AAATGCTGTT 


GATTATTACC 


AGAGCTTTAT 


AGGTUITAAAA 


TTCAATAAAT 


10020 


TCATAGAGCC 


TCAGTTAGAT 


GAG6ATTTGA 


CAATTTATAT 


GAAAGATAAA 


GCATTATCTC 


10080 


CAAAAAAATC 


AAATTGGGAC 


ACAGTTTATC 


CTGCATCTAA 


TTTACTGTAC 


CGTACTAACG 


10140 


CATCCAACGA 


ATCACGAAGA 


TTAGTTGAAG 


TATTTATAGC 


AGATAGTAAA 


TTTGATCCTC 


10200 


ATCAGATATT 


GGATTAT6TA 


GAATCTGGGG 


ACTGGTTAGA 


TGATCCAGAA 


TTTAATATTT 


10260 


CTTATAGTCT 


TAAAGAAAAA 


GAGATCAAAC 


AGGAAGGTAG 


ACTCTTTGCA 


AAAATGACAT 


10320 


ACAAAATGAG 


AGCTACACAA 


GTTTTATCAG 


AGACCCTACT 


TGCAAATAAC 


ATAGGAAAAT 


10380 


TCTTTCAAGA 


AAATGGGATG 


GTGAAGGGAG 


AGATTGAATT 


ACTTAAGAGA 


TTT^CAACCA 


10440 


TATCAATATC 


AGGAGTTCCA 


CGGTATAATG 


AAGTGTACAA 


TAATTCTAAA 


AGCCATACAG 


10500 


ATGACCTTAA 


AACCTACAAT 


AAAATAAGTA 


ATCTTAATTT 


GTCTTCTAAT 


CAGAAATCAA 


10560 


AGAAATTTGA 


ATTCAA6TCA 


ACGGATATCT 


ACAATGATGG 


ATACGAGACT 


GTGAGCTGTT 


10620 


TCCTAACAAC 


AGATCTCAAA 


ATUITACTGTC 


TTAATTGGAG 


ATATGAATCA 


ACAGCTCTAT 


10680 


TTGGAGAAAC 


TTGCAACCAA 


ATATTTGGAT 


TAAATAAATT 


GTTTAATTGG 


TTACACCCTC 


10740 


GTCTTGAAGG 


AAGTACAATC 


TATGTAGGTG 


ATCCTTACTG 


TCCTCCATCA 


GATAAAGAAC 


10800 
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ATATATCATT 


AGAGGATCAC 


CCTGATTCTG 


GTTTTTACGT 


TCATAACCCA 


AGAGGGGGTA 


10860 


TAGAA66ATT 


TTGTCAAAAA 


TTATGGACAC 


TCATATCTAT 


AAGTGCAATA 


CATCTAGCAG 


10920 


CTGTTAGAAT 


AGGCGTGAGG 


GTGACT6CAA 


TGGTTCAAGG 


AGACAATCAA 


GCTATAGCTG 


10980 


TAACCACAA6 


AGTACCCAAC 


AATTATGACT 


ACAGAGTTAA 


GAAGGAGATA 


GTTTATAAAG 


11040 


ATGTA6TGA6 


ATTTTTTGAT 


TCATTAAGAG 


AAGTGATGGA 


TGATCTAGGT 


CATGAACTTA 


11100 


AATTAAATGA 


AACGATTATA 


AGTAGCAAGA 


TGTTCATATA 


TAGCAAAAGA 


ATCTATTATG 


11160 


ATGGGAGAAT 


TCTTCCTCAA 


GCTCTAAAAG 


CATTATCTAG 


ATGTGTCTTC 


TGGTCAGAGA 


11220 


CAGTAATAGA 


CGAAACAAGA 


TCAGCATCTT 


C/J^TTTGGC 


AACATCATTT 


GCAAAAGCAA 


11280 


TTGAGAATGG 


TTATTCACCT 


GTTCTAGGAT 


ATGCATGCTC 


AATTTTTAAG 


AATATTCAAC 


11340 


AACTATATAT 


TGCCCTTGGG 


ATGAATATCA 


ATCCAACTAT 


AACACAGAAT 


ATCAGAGATC 


11400 


AGTATTTTAG 


GAATCCAAAT 


TGGATGCAAT 


ATGCCTCTTT 


AATACCTGCT 


AGTGTTGGGG 


11460 


GATTCAATCA 


CATGGCGATG 


TCAAGATGTT 


TTGTAAGGAA 


TATTGGTGAT 


CCATCAGTTG 


11520 


CCGCATTGGC 


TGATATTAAA 


AGATTTATTA 


AGGCGAATCT 


ATTAGACCGA 


AGTGTTCTTT 


11580 


ATAGGATTAT 


GAATCAAGAA 


CCAGGTGAGT 


CATCTTTTTT 


TGATTGGGCT 


TCAGATCCAT 


11640 


ATTCATGCAA 


TTTACCACAA 


TCTCAAAATA 


TAACCACCAT 


GATAAAAAAT 


ATAACAGCAA 


11700 


GGAATGTATT 


ACAAGATTCA 


CCAAATCCAT 


TATTATCTGG 


ATTATTCACA 


AATACAATGA 


11760 


TAGAAGAAGA 


TGAAGAATTA 


GCTGAGTTCC 


TGATGGACAG 


GAAGGTAATT 


CTCCCTAGAG 


11820 


TTGCACATGA 


TATTCTAGAT 


AATTCTCTCA 


CAGGAATTAG 


AAATGCCATA 


GCTGGAATGT 


11660 


TAGATACGAC 


AAAATCACTA 


ATTCGGGTTG 


GCATAAATAG 


AGGAGGACTG 


ACATATAGTT 


11940 


TGTTGAGGAA 


AATCAGTAAT 


TACGATCTAG 


TACAATATGA 


AACACTAAGT 


AGGACTTTGC 


12000 


GACTAATTGT 


AAGTGATAAA 


ATCAAGTATG 


AAGATATGTG 


TTCGGTAGAC 


CTTGCCATAG 


12060 


CATTGCXMVCA 


AAAGATGTGG 


ATTCATTTAT 


CAGGAGGAAG 


GATGATAAGT 


GGACTTGAAA 


12120 


CGCCTGACCC 


ATTAGAATTA 


CTATCTGGGG 


TAGTAATMC 


AGGATCAGAA 


CATTGTAAAA 


12180 


TATGTTATTC 


TTCAGATGGC 


ACAAACCCAT 


ATACTTGGAT 


GTATTTACCC 


GGTAATATCA 


12240 


AAATAGGATC 


AGCAGATIACA 


GGTATATCGT 


CATTAAGAGT 


TCCTTATTTT 


GGATCAGTCA 


12300 


CTGATGAAAG 


ATCTGAAGCA 


CAATTAGGAT 


ATATCAAGAA 


TCTTAGTAAA 


CCTGCAAAAG 


12360 


CCGCAATAAG 


AATAGCAATG 


ATATATACAT 


GGGCATTTGG 


TAATGATGAG 


ATATCTTGGA 


12420 


TGGAAGCCTC 


ACAGATAGCA 


CAAACACGTG 


CAAATTTTAC 


ACTAGATAGT 


CTCAAAATTT 


12480 


TAACACCGGT 


AGCTACATCA 


ACAAATTTAT 


CACACAGATT 


AAAGGATACT 


GCAACTCAGA 


12540 


TGAAATTCTC 


CAGTACATCA 


TTGATCAGAG 


TCAGCAGATT 


CATAACAATG 


TCCAATGATA 


12600 


ACATGTCTAT 


CAT^GAAGCT 


AATGAAACCA 


AAGATACTAA 


TCTTATTTAT 


CAACAAATAA 


12660 


TGTTAACAGG 


ATTAAGTGTT 


TTCGAATATT 


TATTTAGATT 


AAAAGAAACC 


ACAGGACACA 


12720 


ACCCTATAGT 


TATGCATCTG 


CACATAGAAG 


ATGAGTGTTG 


TATTAAAGAA 


AGTTTTAATG 


12780 


ATGAACATAT 


TAATCCAGAG 


TCTACATTAG 


AATTAATTCG 


ATATCCTGAA 


AGTAATGAAT 


12840 


TTATTTATGA 


TAAAGACCCA 


CTCAAAGATG 


TGGACTTATC 


AAAACTTATG 


GTTATTAAAG 


12900 
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ACCATTCTTA 


CACAATTGAT 


ATGAATTATT 


GGGATGATAC 


TGACATCATA 


CATGCAATTT 


12960 


CAATATGTAC 


TGCAATTACA 


ATAGCAGATA 


CTATGTCACA 


ATTAGATCGA 


GATAATTTAA 


13020 


AAGAGATAAT AGTTATTGCA AATGATGATG ATATTAATAG 


CTTAATCACT 


GAATTTTTGA 


13080 


CTCTTGACAT 


ACTTGTATTT 


CTCAAGACAT 


TTGGTGGATT 


ATTAGTAAAT 


CAATTTGCAT 


13140 


ACACTCTTTA 


TAGTCTAAAA 


ATAGAAGGTA 


GGGATCTCAT 


TTGGGATTAT 


ATAATGAGAA 


13200 


CACTGAGAGA 


TACTTCCCAT 


TCAATATTAA 


AAGTATTATC 


TAATGCATTA 


TCTCATCCTA 


13260 


AAGTATTCAA 


GAGGTTCTGG 


GATTGTGGAG 


TTTTAAACCC 


TATTTATGGG 


CCTAATATCG 


13320 


CTAGTCAAGA 


CCAGATAAAA 


CTTGCCCTAT 


CTATATGTGA 


ATATTCACTA 


GATCTATTTA 


13380 


TGAGAGAATG 


GTTGAATGGT 


GTATCACTTG 


AAATATACAT 


TTGTGACAGC 


GATATGGAAG 


13440 


TTGCAAATGA 


TAGGAAACAA 


GCCTTTATTT 


CTAGACACCT 


TTCATTTGTT 


TGTTGTTTAG 


13500 


CAGAAATTGC 


ATCTTTCGGA 


CCTAACCTGT 


TA7VACTTAAC 


ATACTTGGAG 


AGACTTGATC 


13560 


TATTGAAACA 


ATATCTTGAA 


TTAAATATTA AAGAAGACCC 


TACTCTTAAA 


TATGTACAAA 


13620 


TATCTGGATT 


ATTAATTAAA 


TCGTTCCCAT 


CAACTGTAAC 


ATACGTAAGA 


AAGACTGCAA 


13680 


TCAAATATCT 


AAGGATTCGC 


GGTATTAGTC 


CACCTGAGGT 


AATTGATGAT 


TGGGATCCGG 


13740 


TAGAAGATGA 


AAATATGCTG 


GATAACATTG 


TCAAAACTAT 


AAATGATAAC 


TGTAATAAAG 


13800 


ATAATAAAGG 


GAATAAAATT AACAATTTCT GGGGACTAGC 


ACTTAAGAAC 


TATCAAGTCC 


13860 


TTAAAATCAG 


ATCTATAACA 


AGTGATTCTG 


ATGATAATGA 


TAGACTAGAT 


GCTAATACAA 


13920 


GTGGTTTGAC 


ACTTCCTCAA 


GGAGGGAATT 


ATCTATCGCA 


TCAATTGAGA 


TTATTCGGAA 


13980 


TCAACAGCAC 


TAGTTGTCTG 


AAAGCTCTTG 


AGTTATCACA AATTTTAATG 


AAGGAAGTCA 


14040 


ATAAAGACAA 


GGACAGGCTC 


TTCCTGGGAG 


AAGGAGCAGG 


AGCTATGCTA 


GCATGTTATG 


14100 


ATGCCACATT 


AGGACCTGCA 


GTTAATTATT 


ATAATTCAGG 


TTTGAATATA 


ACAGATGTAA 


14160 


TTGGTCAACG 


AGAATTGAAA 


ATATTTCCTT 


CAGAGGTATC 


ATTAGTAGGT 


AAAAAATTA6 


14220 


GAAATGTGAC 


ACAGATTCTT 


AACAGGGTAA 


AAGTACTGTT 


CAATGGGAAT 


CCTAATTCAA 


14280 


CATGGATAGG 


AAATATGGAA TGTGAGAGCT 


TAATATGGAG 


TGAATTAAAT 


GATAAGTCCA 


14340 


TTGGATTAGT 


ACATTGTGAT 


ATGGAAGGAG 


CTATCGGTAA 


ATCAGAAGAA 


ACTGTTCTAC 


14400 


ATGAACATTA 


TAGTGTTATA AGAATTACAT 


ACTTGATTGG 


GGATGATGAT 


GTTGTTTTAG 


14460 


TTTCCAAAAT 


TATACCTACA 


ATCACTCCGA 


ATTGGTCTAG 


AATACTTTAT 


CTATATAAAT 


14520 


TATATTGGAA AGATGTAAGT ATAATATCAC 


TCAAAACTTC 


TAATCCTGCA 


TCAACAGAAT 


14580 


TATATCTAAT 


TTCGAAAGAT 


GCATATTGTA 


CTATAATGGA 


ACCTAGTGAA 


ATTGTTTTAT 


14640 


CATUVACTTAA 


AAGATTGTCA 


CTCTTGGAAG 


AAAATAATCT 


ATTAAAATGG 


ATCATTTTAT 


14700 


CAAAGAAGAG 


GAATAATGAA 


TGGTTACATC 


ATGAAATCAA 


AGAAGGAGAA 


AGAGATTATG 


14760 


GAATCATGAG 


ACCATATCAT ATGGCACTAC AAATCTTTGG 


ATTTCAAATC 


AATTTAAATC 


14820 


ATCTGGCGAA 


AG7ATTTTTA 


TCAACCCCAG 


ATCTGACTAA 


TATCAACAAT 


ATAATCCAAA 


14880 


GTTTTCAGCG 


AACAATAAAG 


GATGTTTTAT 


TTGAATGGAT 


TAATATAACT 


CATGATGATA 


14940 


AGAGACATAA ATTAGGCGGA AGATATAACTl TATTCCCACT GAAAAATAAG GGAAAGTTAA 


15000 
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GACTGCTATC 


GAGAAGACTA 


GTATTAAGTT 


G6ATTTCATT 


ATCATTATCG 


ACTC6ATTAC 


15060 


TTACAGGTCG 


CTTTCCTGAT 


GAAAAATTTG 


AACATAGAGC 


ACAGACTGGA 


TATGTATCAT 


15120 


TAGCTGATAC 


TGATTTAGAA 


TCATTAAAGT 


TATTGTCGAA 


AAACATCATT 


AAGAATTACA 


15180 


GAGAGTGTAT 


AGGATCAATA 


TCATATTGGT 


TTCTAACCAA 


AGAAGTTAAA 


ATACTTATGA 


15240 


AATTGATCGG 


TGGTGCTAAA 


TTATTAGGAA 


TTCCCAGACA 


ATATAAAGAA 


CCCGAAGACC 


15300 


AGTTATTAGA 


AAACTACAAT 


CAACATGATG 


AATTTGATAT 


CGATTAAAAC 


ATAAATACAA 


15360 


TGAAGATATA 


TCCTAACCTT 


TATCTTTAAG 


CCTAGGAATA 


GACAAAAAGT 


AAGAAAAACA 


15420 


TGTAATATAT 


ATATACCAAA 


CAGAGTTCTT 


CTCTTGTTTG 


GT 




15462 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
TTGTCTGGGA AT 12 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
TTGCCTGGGA AT 

(2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
TTGTTTGGGA AT 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 



TTGTCTGGTA AT 



12 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
AACTTTAAAT TA 12 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
AACTTAAAAT TA 12 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
TTAAAGACAT TG 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 
TTTAAGACAT TG 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHTU^VCTERI ST I CS : 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 
GCAGATGTCA AG 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 
GCAGATGCCA AG 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 
CGAATCTT^ GA 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STR7UTOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 
CGAAGCTAAA GA 
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(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 
GAAATATTGA TC 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRJ^EDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 
GAAACATTGA TC 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 
TCTCTACCCA AC 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 
TCGTTAACCA AC 

(2) INFORMATION FOR SEQ ID NO:62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 



AGTACAATAG GT 



12 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 
AGTACTGTGG GT 12 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 
GCACTTGATC CA 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
ACACTGGATC CA 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 
CCATCATTGT TGTTGACAA 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 
CCATCATTGT GGCTGACAA 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 
TTACATGGCC A 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 
TCACATGGCG A 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 
TTTGGACTGG 6C 
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(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 
TTTTGATTGG GC 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 
GGTCCTAATA CT 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 
GGGCCTAATA TC 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 
CCATAGAGAG TCCATGGAAA GCGACGCTAA AAACTATC 



wo 98/53078 



PCT/US98/10551 



189 

WHAT IS CLAIMED IS : 

1. An isolated polynucleotide molecule comprising 
an operably linked transcriptional promoter, a polynucleotide 
sequence encoding a PIV genome or antigenome, and a 
transcriptional terminator, wherein said polynucleotide 
sequence encoding said PIV genome or antigenome is modified by 
introduction of a heterologous PIV sequence selected from a 
HPIVl sequence, a HPIV2 sequence, a HPIV3 sequence, a BPIV 
sequence or a MPIV sequence to form a chimeric PIV genome or 
antigenome. 

2 . The isolated polynucleotide molecule of claim 

1, wherein a gene or gene segment of human PIV3 is replaced 
with a counterpart gene or gene segment from a heterologous 
PIV. 

3. The isolated polynucleotide molecule of claim 

2, wherein the counterpart gene or gene segment is a HN or F 
glycoprotein gene or gene segment of HPIVl or HPIV2. 

4 . The isolated polynucleotide molecule of claim 
2, wherein an HN or F glycoprotein gene of PIVl or PIV2 is 
substituted for the counterpart HN or F glycoprotein gene of 
HPIV3 . 

5. The isolated polynucleotide molecule of claim 
1, wherein the polynucleotide sequence encoding the genome or 
antigenome incorporates a BPIV gene or gene segment. 

6. The isolated polynucleotide molecule of claim 
1, which incorporates a heterologous sequence from RSV. 

7. The isolated polynucleotide molecule of claim 
6, wherein the heterologous sequence from RSV is a G or F gene 
or gene segment . 
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8. The isolated polynucleotide molecule of claim 

I, which incorporates a heterologous sequence from measles 
virus . 

9. The isolated polynucleotide molecule of claim 
8, wherein the heterologous sequence from measles virus is a 
HA or F gene or gene segment . 

10. An isolated polynucleotide molecule comprising 
an operably linked transcriptional promoter, a polynucleotide 
sequence encoding a PIV genome or antigenome, and a 
transcriptional terminator, wherein said polynucleotide 
sequence encoding said PIV genome or antigenome is selected 
from the group consisting of: 

i) p218(131) (SEQ ID NO: 1) ; 

ii) p3/7(131) (SEQ ID NO: 14); 

iii) p3/7(131)2G (SEQ ID NO: 15); or 

iv) the isolated polynucleotide of i) , ii) or iii) 
modified by introduction of a heterologous PIV sequence 
selected from a HPIVl sequence, a HPIV2 sequence, a BPIV 
sequence or a MPIV sequence or by a nucleotide insertion, 
rearrangement, deletit)n or substitution specifying a 
phenotypic alteration selected from attenuation, temperature- 
sensitivity, cold-adaptation, small plaque size, host range 
restriction, or a change in an immunogenic epitope of PIV. 

11. An isolated polynucleotide molecule comprising 
an operably linked transcriptional promoter, a polynucleotide 
sequence encoding a PIV genome or antigenome, and a 
transcriptional terminator, wherein said polynucleotide 
sequence encoding said PIV genome or antigenome is modified by 
a nucleotide insertion, rearrangement, deletion or 
substitution. 

12. The isolated polynucleotide molecule of claim 

II, wherein said nucleotide insertion, rearrangement, deletion 
or substitution specifies a phenotypic alteration selected 
from attenuation, temperature-sensitivity, cold- adaptation, 
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small plaque size, host range restriction, or a change in an 
immunogenic epitope of PIV, 

13. The isolated polynucleotide molecule of claim 
12, wherein said polynucleotide sequence encoding said PIV 
genome or antigenome incorporates multiple ts mutations. 

14 . The isolated polynucleotide molecule of claim 
12, wherein said polynucleotide sequence encoding said PIV 
genome or antigenome incorporates multiple non-ts attenuating 
mutations . 

15. The isolated polynucleotide molecule of claim 
11, wherein said polynucleotide sequence encoding said PIV 
genome or antigenome incorporates one or more mutations of JS 
cp45 • 

16. The isolated polynucleotide molecule of claim 

15, wherein said polynucleotide sequence encoding said PIV 
genome or antigenome encodes at least one amino acid 
substitution in the polymerase L protein. 

17. The isolated polynucleotide molecule of claim 

16, wherein the amino acid substitution in the polymerase L 
protein occurs at a position corresponding to Tyr942, LeUggj/ or 
Thr^ssa of JS cp45. 

18. The isolated polynucleotide molecule of claim 
11, wherein said polynucleotide sequence encoding said PIV 
genome or antigenome encodes at least one amino acid 
substitution in the N protein. 

19. The isolated polynucleotide molecule of claim 
28, wherein the amino acid substitution in the N protein 
occurs at a position corresponding to residues Val^^ or Ser^ss 
of JS cp45. 
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20. The isolated polynucleotide molecule of claim 
11, wherein said polynucleotide sequence encoding said PIV 
genome or antigenome encodes an amino acid substitution in the 
C protein. 

21. The isolated polynucleotide molecule of claim 
20, wherein the amino acid substitution in the C protein 
occurs at a position corresponding to Ilegg of JS cp45. 

22. The isolated polynucleotide molecule of claim 
11, wherein said polynucleotide sequence encoding said PIV 
genome or antigenome encodes at least one amino acid 
substitution in the F protein. 

23. The isolated polynucleotide molecule of claim 
22, wherein the amino acid substitution in the F protein 
occurs at a position corresponding to Ile42o or Ala45o of JS 
cp45 . 

24 . The isolated polynucleotide molecule of claim 
11, wherein said polynucleotide sequence encoding said PIV 
genome or antigenome encodes an amino acid substitution in the 
HN protein. 

25. The isolated polynucleotide molecule of claim 
24, wherein the amino acid substitution in the HN protein 
occurs at a position corresponding to residue Valj^^ of JS 
cp45 • 

26. The isolated polynucleotide molecule of claim 
11, wherein said polynucleotide sequence encoding said PIV 
genome or antigenome incorporates at least one mutation in a 
3' leader sequence. 

27. The isolated polynucleotide molecule of claim 
26, wherein the mutation in the 3' leader occurs at a position 
corresponding to nucleotide 23, 24, 28, or 45 of JS cp45. 
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28. The isolated polynucleotide molecule of claim 

11, wherein said polynucleotide sequence encoding said PIV 
genome or antigenome incorporates a mutation in a N gene start 
sequence . 

29. The isolated polynucleotide molecule of claim 
28, wherein the mutation in the N gene start sequence occurs 
at a position corresponding to nucleotide 62 of JS cp45. 

30. The isolated polynucleotide molecule of claim 

12, wherein said polynucleotide sequence encoding said PIV 
genome or antigenome incorporates a plurality and up to a full 
complement of mutations present in rcp45, rcp45 3'NCMFHN, 
rcp45 3'NL, rcp45 3'N, or rcp45 F. 

31. The isolated polynucleotide molecule of claim 
12, which is an antigenomic cDNA selected from rcp45, rcp45 
3'NCMFHN, rcp45 3'NL, rcp45 3'N, rcp45 L, rcp45 F, rcp45 M, 
rcp45 HN, or rcp45 C. 

32. The isolated polynucleotide molecule of claim 
11, wherein said polynucleotide sequence encoding said PIV 
genome or antigenome incorporates a mutation stabilized by 
multiple nucleotide substitutions in a codon specifying the 
mutation. 

33 . The isolated polynucleotide molecule of claim 
11, wherein said polynucleotide sequence encoding said PIV 
genome or antigenome incorporates a heterologous sequence from 
HPIVl, HPIV2, HPIV3, BPIV or MPIV to form a chimeric genome or 
antigenome. 

34. The isolated polynucleotide molecule of claim 
33, wherein said chimeric genome or antigenome incorporates 
one or more ts mutations. 
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35. The isolated polynucleotide molecule of claim 
33, wherein said chimeric genome or antigenome incorporates 
one or more non-ts attenuating mutations. 

36. The isolated polynucleotide molecule of claim 
5 33, wherein said chimeric genome or antigenome incorporates 

one or more mutations of JS cp45. 

37. The isolated polynucleotide molecule of claim 
36, wherein said one or more mutations of JS cp45 occur in one 
or more PIV proteins selected from L, M, N, C, F, or HN or in 

10 a PIV extragenic sequence selected from a 3' leader or N-gene 
start sequence. 

38. The isolated polynucleotide molecule of claim 
33, wherein said chimeric genome or antigenome incorporates 
multiple mutations each specifying a phenotype selected from 

15 attenuation, temperature-sensitivity, cold-adaptation, small 
plaque size, or host range restriction. 

39. The isolated polynucleotide molecule of claim 
33, wherein said chimeric genome or antigenome incorporates at 
least one and up to a full complement of mutations present in 

20 rcp45, rcp45 3'NCMFHN, rcp45 3'NL, rcp45 3'N, or rcp45 F, 

40. The isolated polynucleotide molecule of claim 
33, wherein a mutation specifying a phenotypic alteration 
selected from attenuation, temperature-sensitivity, cold- 
adaptation, small plaque size, host range restriction, or a 

25 change in an immunogenic epitope of PIV is incorporated in a 
chimeric PIV background comprising a genome or antigenome 
having one or more PIV3 HN or F glycoprotein genes substituted 
by one or more counterpart PIVl or PIV2 HN and F glycoprotein 
genes . 



30 



41. The isolated polynucleotide molecule of claim 
33, wherein the heterologous sequence specifies a phenotypic 
alteration selected from attenuation, temperature-sensitivity. 
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cold-adaptation, small plaque size, host range restriction, or 
a change in an immunogenic epitope of a chimeric PIV. 

42. The isolated polynucleotide molecule of claim 
11, which incorporates a cis-acting regulatory sequence of 

5 HPIVl, HPIV2, BPIV or MPIV. 

43 . The isolated polynucleotide molecule of claim 
11, which incorporates a heterologous sequence from RSV, 

44 . The isolated polynucleotide molecule of claim 
43, wherein the heterologous sequence from RSV is a G or F 

10 gene or gene segment. 

45. The isolated polynucleotide molecule of claim 
11, which incorporates a heterologous sequence from measles 
virus. 



46. The isolated polynucleotide molecule of claim 
15 45, wherein the heterologous sequence from measles virus is a 

HA or F gene or gene segment. 

47. The isolated polynucleotide molecule of claim 
11, which incorporates a polynucleotide sequence encoding a 
non-PIV molecule selected from a cytokine, a T-helper epitope, 

20 a restriction site marker, or a protein of a microbial 

pathogen capable of eliciting a protective immune response in 
a mammalian host. 



48. A cell or cell-free composition including an 
expression vector which comprises an isolated polynucleotide 
25 molecule encoding a PIV genome or antigenome and an expression 
vector which comprises one or more isolated polynucleotide 
molecules that encode (s) N, P and L proteins of PIV, whereby 
expression of said PIV genome or antigenome and N, P, and L 
proteins yields an infectious PIV particle. 
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49. The cell or cell -free composition of claim 48, 
wherein the infectious PIV particle is a virus. 

50. The cell or cell-free composition of claim 48, 
wherein the infectious PIV particle is a subviral particle. 

5 51. The cell or cell-free composition of claim 48, 

wherein the polynucleotide encoding the PIV genome or 
antigenome and the one or more polynucleotides encoding N, P 
and L proteins of PIV are incorporated within a single vector. 

52. A method for producing an infectious PIV 

10 particle from one or more isolated polynucleotide molecules 

encoding said PIV, comprising: 

coexpressing in a cell or cell-free system an 

expression vector which comprises a polynucleotide molecule 

encoding a PIV genome or antigenome and an expression vector 
15 which comprises one or more polynucleotide molecules encoding 

N, P and L proteins, thereby producing an infectious PIV 

particle. 

53. The method of claim 52, wherein the PIV genome 
or antigenome and the N, P, and L proteins are expressed by 

20 the same expression vector. 

54. The method of claim 52, wherein the N, P, and L 
proteins are encoded on two or three different expression 
vectors . 

55. The method of claim 52, wherein at least one of 
25 the N, P and L proteins is supplied by coinfection with PIV. 

56. The method of claim 52, wherein the 
polynucleotide molecule that encodes the PIV genome or 
antigenome is cDNA. 



57. The method of claim 52, wherein the infectious 
30 PIV particle is a virus. 
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58. The method of claim 52, wherein the infectious 
PIV particle is a subviral particle. 

59. The method of claim 52, wherein the 
polynucleotide molecule encoding the PIV genome or antigenome 

5 is a human, bovine or murine PIV sequence. 

60. The method of claim 52, wherein the 
polynucleotide molecule encoding the PIV genome or antigenome 
encodes the sequence of a wild- type PIV strain. 

61. The method of claim 52, wherein the 

10 polynucleotide molecule encoding the PIV genome or antigenome 
encodes HPIV3 . 

62. The method of claim 52, wherein the 
polynucleotide molecule encoding the PIV genome or antigenome 
incorporates an attenuating mutation from a biologically 

15 derived PIV strain. 

63. The method of claim 52, wherein the 
polynucleotide molecule encoding the PIV genome or antigenome 
incorporates one or more ts mutations. 

64. The method of claim 52, wherein the 

20 polynucleotide molecule encoding the PIV genome or antigenome 
incorporates one or more non-ts attenuating mutations. 

65. The method of claim 52, wherein the 
polynucleotide molecule encoding the PIV genome or antigenome 
incorporates at least one mutation of JS cp45. 

25 66. The method of claim 65, wherein the 

polynucleotide molecule encoding the PIV genome or antigenome 
incorporates multiple mutations of JS cp45. 
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67. The method of claim 65, wherein the mutation of 
JS cp45 specifies at least one amino acid substitution in the 
polymerase L protein. 

68. The method of claim 67, wherein the amino acid 
5 substitution in the polymerase L occurs at a position 

corresponding to Tyrg^a, Leuggji or Thrig^g of JS cp45. 

69. The method of claim 65, wherein said mutation 
of JS cp45 specifies a change in a PIV protein selected from 
L, M, N, C, F, or HN or in a PIV extragenic sequence selected 

10 from a 3' leader or N gene start sequence. 

70. The method of claim 52, wherein said 
polynucleotide molecule encoding the PIV genome or antigenome 
incorporates a mutation that is stabilized by multiple 
nucleotide substitutions in a codon which specifies the 

15 mutation. 

71. The method of claim 52, wherein said 
polynucleotide molecule encoding said PIV genome or antigenome 
incorporates a plurality and up to a full complement of 
mutations present in rcp45, rcp45 3'NCMFHN, rcp45 3'NL, rcp45 

20 3'N, or rcp45 F. 

72. The method of claim 69, wherein said 
polynucleotide molecule encoding said PIV genome or antigenome 
is an antigenomic cDNA selected from rcp45, rcp45 3'NCMFHN, 
rcp45 3'NL, rcp45 3'N, rcp45 L, rcp45 F, rcp45 M, rcp45 HN, or 

25 rcp45 C. 

73. The method of claim 52, wherein said 
polynucleotide molecule encoding said PIV genome or antigenome 
incorporates a heterologous sequence from HPIVl, HPIV2, HPIV3, 
BPIV or MPIV to form a chimeric genome or antigenome. 



74 . The method of claim 73 , wherein the 
polynucleotide molecule encoding the PIV genome or antigenome 
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is a chimera of a HPIV3 sequence and a HPIVl, HPIV2, BPIV or 
MPIV sequence. 

75. The method of claim 74, wherein a heterologous 
sequence from HPIVl or HPIV2 encoding a gene or gene segment 

5 of an HN or F glycoprotein is substituted for a corresponding 
gene or gene segment of HPIV3 . 

76. The method of claim 73, wherein said chimeric 
genome or antigenome incorporates one or more ts mutations. 

77. The method of claim 73, wherein said chimeric 
10 genome or antigenome incorporates one or more non-ts 

attenuating mutations. 

78. The method of claim 73, wherein said chimeric 
genome or antigenome incorporates one or more mutations of JS 
cp45, 

15 79. The method of claim 73, wherein said chimeric 

genome or antigenome incorporates multiple mutations each 
specifying a phenotype selected from attenuation, temperature- 
sensitivity, cold-adaptation, small plaque size, or host range 
restriction. 

20 80. The method of claim 73, wherein a mutation 

specifying a phenotypic alteration selected from attenuation, 
temperature-sensitivity, cold- adaptation, small plaque size, 
host range restriction, or a change in an immunogenic epitope 
of PIV is incorporated in a chimeric PIV background comprising 

25 a genome or antigenome having one or more PIV3 HN or F 

glycoprotein genes substituted by one or more counterpart PIVl 
or PIV2 HN and F glycoprotein genes. 

81. The method of claim 80, wherein one or more 
mutations of JS cp45 are incorporated in a chimeric background 
30 comprising a genome or antigenome having both PIV3 HN and F 
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glycoprotein genes substituted by counterpart PIVl or PIV2 HN 
and F glycoprotein genes. 

82. The method of claim 81, wherein said one or 
more mutations of JS cp45 occur in one or more PIV proteins 

5 selected from L, M, C, F, or HN or in a PIV extragenic 

sequence selected from a 3' leader or N gene start sequence • 

83. The method of claim 12, wherein the 
heterologous sequence specifies a phenotypic alteration 
selected from attenuation, temperature-sensitivity, cold- 

10 adaptation, small plaque size, host range restriction, or a 
change in an immunogenic epitope of a chimeric PIV. 

84. The method of claim 52, wherein said 
polynucleotide molecule encoding said PIV genome or antigenome 
incorporates a heterologous sequence from RSV. 

15 85. The method of claim 84, wherein the 

heterologous sequence from RSV is a G or F gene or gene 
segment - 

86. The method of claim 52, wherein said 
polynucleotide molecule encoding said PIV genome or antigenome 

20 incorporates a heterologous sequence from measles virus. 

87. The method of claim 86, wherein the 
heterologous sequence from measles virus is a HA or F gene or 
gene segment. 



25 



88. The method of claim 52, wherein the 
polynucleotide molecule encoding the PIV genome or antigenome 
is selected from: 



30 



i) p218(131) (SEQ ID NO: 1) ; 

ii) p3/7(131) (SEQ ID NO: 14); 

iii) p3/7(131)2G (SEQ ID NO: 15); or 

iv) the polynucleotide molecule of i) , ii) or iii) 



modified by introduction of a heterologous PIV sequence 
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selected from a HPIVl sequence, a HPIV2 sequence, a BPIV 
sequence or a MPIV sequence or by a nucleotide insertion, 
rearrangement, deletion or substitution specifying a 
phenotypic alteration selected from attenuation, temperature- 
5 sensitivity, cold- adaptation, small plaque size, host range 
restriction, or a change in an immunogenic epitope of PIV. 



89. The method of claim 52, wherein the 
polynucleotide molecule encoding the PIV genome or antigenome 
is selected from: 
10 i) p218(131) (SEQ ID NO: 1) ; 

ii) p3/7{131) (SEQ ID NO: 14); 

iii) p3/7 (131)20 (SEQ ID NO: 15); or 

iv) the polynucleotide molecule of i) , ii) or iii) 
modified by introduction of a heterologous PIV sequence 

15 selected from a HPIVl sequence, a HPIV2 sequence, a BPIV 
sequence or a MPIV sequence and by a nucleotide insertion, 
rearrangement, deletion or substitution different from said 
introduction of said heterologous PIV sequence specifying a 
phenotypic alteration selected from attenuation, temperature- 

20 sensitivity, cold- adaptation, small plaque size, host range 
restriction, or a change in an immunogenic epitope of PIV. 



90. The method of claim 52, wherein the 
polynucleotide molecule encoding the PIV genome or antigenome 
is modified to encode a non-PIV molecule selected from a 

25 cytokine, a T-helper epitope, a restriction site marker, or a 
protein of a microbial pathogen capable of eliciting a 
protective immune response in a mammalian host. 

91. An isolated infectious PIV particle which 
comprises a recombinant PIV genome or antigenome, a N protein, 

30 a P protein, and a L protein. 

92. The isolated infectious PIV particle of claim 
91, which is a subviral particle. 
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93. The isolated infectious PIV particle of claim 
91, which is a virus. 

94. The isolated infectious PIV particle of claim 
91, wherein the recombinant PIV genome or antigenome 

5 incorporates a heterologous sequence from RSV or measles 



95. The isolated infectious PIV particle of claim 
91, wherein the recombinant PIV genome or antigenome is a 
cDNA. 



91, which is a human PIV. 

97. The isolated infectious PIV particle of claim 
91, wherein the recombinant PIV genome or antigenome is a 
chimera of heterologous PIV sequences selected from HPIVl, 

15 HPIV2, HPIV3, BPIV, or MPIV sequences. 

98. The isolated infectious PIV particle of claim 
91, wherein the recombinant PIV genome or antigenome is 
selected from: 

i) p218(131) (SEQ ID NO: 1) ; 
20 ii) p3/7(131) (SEQ ID NO: 14); 

iii) p3/7(131)2G (SEQ ID NO: 15); or 

iv) the genome or antigenome of i) , ii) or iii) 
modified by introduction of a heterologous PIV sequence 
selected from a HPIVl sequence, a HPIV2 sequence, a BPIV 

25 sequence or a MPIV sequence or by a nucleotide insertion, 
rearrangement, deletion or substitution specifying a 
phenotypic alteration selected from attenuation, temperature - 
sensitivity, cold-adaptation, small plaque size, host range 
restriction, or a change in an immunogenic epitope of PIV. 



virus . 



10 



96. 



The isolated infectious PIV particle of claim 



30 



99. The isolated infectious PIV particle of claim 
91, wherein the recombinant PIV genome or antigenome is 
selected from: 



wo 98/53078 PCTAJS98/10551 

203 

i) p218(131) (SEQ ID NO: 1) ; 

ii) p3/7(131) (SEQ ID NO: 14); 

iii) p3/7(131)2G (SEQ ID NO: 15); or 

iv) the genome or antigenome of i) , ii) or iii) 
5 modified by introduction of a heterologous PIV sequence 

selected from a HPIVl sequence, a HPIV2 sequence, a BPIV 
sequence or a MPIV sequence and by a nucleotide insertion, 
rearrangement, deletion or substitution different from said 
introduction of said heterologous PIV sequence specifying a 
10 phenotypic alteration selected from attenuation, temperature- 
sensitivity, cold-adaptation, small plaque size, host range 
restriction, or a change in an immunogenic epitope of PIV. 

100. The isolated infectious PIV particle of claim 
91, wherein the counterpart gene or gene segment is a gene or 
15 gene segment of the HN or F glycoprotein gene of HPIVl or 
HPIV2. 



101. The isolated 
91, wherein the recombinant 
incorporates a heterologous 
20 virus. 



infectious PIV particle of claim 
PIV genome or antigenome 
sequence from RSV or measles 



102. The isolated infectious PIV particle of claim 
91, wherein the recombinant PIV genome or antigenome is 
modified by a nucleotide insertion, rearrangement, deletion or 
substitution encoding a phenotypic alteration selected from 

25 attenuation, temperature-sensitivity, cold- adaptation, small 
plaque size, host range restriction, or a change in an 
immunogenic epitope of PIV. 

103 . The isolated infectious PIV particle of claim 
91, wherein the recombinant PIV genome or antigenome 

30 incorporates multiple ts mutations. 



104. The isolated infectious PIV particle of 
91, wherein the recombinant PIV genome or antigenome 
incorporates multiple non-ts attenuating mutations. 



claim 
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105. The isolated infectious PIV particle of claim 
91, wherein the recombinant PIV genome or antigenome 
incorporates at least one mutation of JS cp45. 

106. The isolated infectious PIV particle of claim 
105, wherein the mutation of JS cp45 specifies an amino acid 
substitution in the polymerase L protein. 

107. The isolated infectious PIV particle of claim 
91, wherein said chimeric genome or antigenome incorporates 
one or more ts mutations. 

108. The isolated infectious PIV particle of claim 
97, wherein said chimeric genome or antigenome incorporates 
one or more non-ts attenuating mutations. 

109. The isolated infectious PIV particle of claim 
97, wherein said chimeric genome or antigenome incorporates 
one or more mutations of JS cp45. 

110. The isolated infectious PIV particle of claim 
129, wherein said chimeric genome or antigenome incorporates 
multiple mutations each specifying a phenotype selected from 
attenuation, temperature-sensitivity, cold-adaptation, small 
plaque size, or host range restriction. 

111. The isolated infectious PIV particle of claim 
109, wherein said chimeric genome or antigenome incorporates 
at least one and up to a full complement of mutations present 
in rcp45, rcp45 3'NCMFHN, rcp45 3'NL, rcp45 3'N, or rcp45 F. 

112. The isolated infectious PIV particle of claim 
91, wherein a mutation specifying a phenotypic alteration 
selected from attenuation, temperature-sensitivity, cold- 
adaptation, small plaque size, host range restriction, or a 
change in an immunogenic epitope of PIV is incorporated in a 
chimeric PIV background comprising a genome or antigenome 
having one or more PIV3 HN or F glycoprotein genes or gene 
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segments substituted by one or more counterpart PIVl or PIV2 
HN and P glycoprotein genes or gene segments. 

113. The isolated infectious PIV particle of claim 
112, wherein one or more mutations of JS cp45 are incorporated 

5 in said chimeric background. 

114 . The isolated infectious PIV particle of claim 
of claim 113, wherein said one or more mutations of JS cp45 
occur in one or more PIV proteins selected from L, M, F, 
or HN or in a PIV extragenic sequence selected from a 3' 
leader or N gene start sequence. 

115. The isolated infectious PIV particle of claim 
91, wherein the recombinant PIV genome or antigenome is 
modified to encode a non-PIV molecule selected from a 
cytokine, a T-helper epitope, a restriction site marker, or a 
protein of a microbial pathogen capable of eliciting a 
protective immune response in a mammalian host. 

116. The isolated infectious PIV particle of claim 
91, further comprising an RSV antigen or epitope which elicits 
protective immunity to RSV in an immunized host. 

20 117. The isolated infectious PIV particle of claim 

91, which is selected from r942, r992, rl558, r942/992, 
r992/l558, r942/1558, or r942/992/l558 , rcp45 3"N, rcp45 C, 
rcp45 M, rcp45 F, rcp45 HN, rcp45L, rcp45 3"NL, rcp45 
3-'NCMFHN, and rcp45. 

25 118. An immunogenic composition comprising an 

immunogenically effective amount of an infectious PIV particle 
in a pharmaceutically acceptable carrier, said PIV particle 
comprising a recombinant PIV genome or antigenome, a N 
protein, a P protein, and a L protein. 

30 119. The immunogenic composition of claim 118, 

wherein said infectious PIV particle is a subviral particle. 



10 



15 
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120. The immunogenic composition of claim 118, 
wherein said infectious PIV particle is a virus , 

121. The immunogenic composition of claim 118, 
wherein the recombinant PIV genome or antigenome incorporates 
a heterologous sequence from RSV or measles virus. 

122. The immunogenic composition of claim 118, 
wherein the recombinant PIV genome or antigenome is a chimera 
of heterologous PIV sequences selected from HPIVl, HPIV2, 
HPIV3, BPIV, or MPIV sequences to form an infectious, chimeric 
PIV particle. 

123. The immunogenic composition of claim 118, 
wherein the recombinant PIV genome or antigenome encodes a 
human PIV in which a gene or gene segment is replaced with a 
counterpart gene or gene segment from a heterologous PIV, 

124. The immunogenic composition of claim 123, 
wherein one or both HN and F glycoprotein genes of HPIVl are 
substituted for HN and F glycoprotein genes of HPIV3 to form 
said infectious, chimeric PIV particle. 

125. The immunogenic composition of claim 123, 
wherein the recombinant PIV genome or antigenome of said 
infectious, chimeric PIV particle is modified by a nucleotide 
insertion, rearrangement, deletion or substitution encoding a 
phenotypic alteration selected from attenuation, temperature- 
sensitivity, cold-adaptation, small plaque size, host range 
restriction, or a change in an immunogenic epitope of PIV. 

126. The immunogenic composition of claim 125, 
wherein said recombinant PIV genome or antigenome incorporates 
multiple mutations selected from ts and non- ts attenuating 
mutations to form an attenuated, infectious, chimeric PIV 
particle. 
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127. The immunogenic composition of claim 118, 
wherein the recombinant PIV genome or antigenome incorporates 
a mutation of JS cp45. 

128. The immunogenic composition of claim 118, 
wherein the recombinant PIV genome or antigenome incorporates 
multiple mutations of JS cp45. 
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RHABDOVIRUSES WITH REENGINEERED COATS 



Related Application Data 

This application claims priority benefit of co-pending U.S. application 
serial number 60/052.366, filed 11 July 1997. 



Technical Field of the Invention 

5 This invention relates primarily to replication-competent rhabdovinises 

which lack a functional glycoprotein gene and express in the viral coat at least one 
receptor for another virus. Recombinant rhabdoviruses of the invention such as 
vesicular stomatitis virus which express at least one cellular receptor and a 
coreceptor of another pathogenic virus such as HIV are useful for the treatment of 

10 patients infected with the pathogenic virus. 



Background of the Invention 

To cause infection, membrane-enveloped viruses such as human inunu- 
nodeficiency virus type 1 (HIV-1) must first bind receptors at the cell surface and 
subsequently fuse their own membrane with that of the cell. This fusion process 
15 releases the viral genetic material mto the cytoplasm and initiates mfection. Entry 
of HIV-1 into cells has long been known to require vuiis binding to the cell 
surface molecule CD4 (Maddon, et a/., 1986; Sattentau and Weiss, 1988), and 
recent studies have defined chemokine receptor molecules, CXCR4 (Feng, et aL, 
1996; Berson, etaL, 1996), CCR5 (Alkhatib, etaL, 1996; Choe, etaL, 1996; 
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Deng, et a/., 1996; Doranz, et al, 1996; Dragic, et al., 1996), and CCR3 
(Choe, et al,, 1996; Doranz, et al., 1996) as co-receptors required for entry. 
HIV entry occurs in stages: initial binding of the viral envelope protein (designat- 
ed gpl20/41) to CD4 is followed by conformational changes that allow binding to 
coreceptor and subsequent coalescence of the viral and cellular membranes 
(Lapham, et aL, 1996; Trkola, et aL, 1996; Wu. et aL, 1996). HIV-infected 
cells express gp 120/41 on the cell surface during infection and prior to its incorpo- 
ration into budding HIV-1 particles. 

Recent studies have shown that vescisular stomatitis virus (VSV) has 
potential as a high level expression vector capable of incorporating foreign 
proteins into the viral envelope (Schnell, et aL, 1996a; Schnell, et aL, 1996b). 
VSV causes an extremely rapid cytopathic infection in most animal cells including 
human T cells in culture, but is normally non-pathogenic in humans (for review 
see Wagner and Rose, 1996). Within two to three hours after infection, VSV 
blocks host cell protein synthesis and within eight hours produces very large 
quantities of progeny virus particles that bud from the cell surface prior to cell 
lysis. VSV has a non-segmented, negative-strand RNA genome that is copied in 
the cytoplasm of infected cells by the viral RNA polymerase to generate five 
mRNAs encoding the five structural proteins. Only one of these proteins, a 
glycoprotein designated G, is present in the viral membrane and is responsible for 
the very broad host range of VSV. G protein recognizes cell surfaces and 
catalyzes fusion of the viral membrane with cellular membranes (Florkiewicz and 
Rose, 1984). Foreign membrane proteins such as CD4 and other viral membrane 
proteins can be expressed at very high levels from the genome of recombinant 
VSVs and these molecules are then incorporated at high levels into the viral 
membrane along with G protein (Schnell, et aL, 1996b). 
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Summary of the Invention 

It is an objective of the invention to utihze this strategy to engineer 
recombinant viruses that can be employed to target cells infected with another 
virus such as HIV. 

5 It is a further and more specific objective to provide a method for 

treating a person infected with HIV. 

These and other objectives are accomplished by the present invention, 
which provides replication-competent recombinant rhabdoviruses such as vesicular 
stomatitis viruses (VSV) that lack a functional glycoprotein gene and express in 

10 their viral coats at least one foreign polypeptide. In preferred embodiments, the 
foreign polypeptide is a cellular receptor for another virus and is expressed in a 
manner sufficient to target cells infected with the other virus. In one embodiment, 
the invention provides replication-competent recombinant vesicular stomatitis virus 
which lacks a functional G glycoprotein gene and displays at least one cellular 

15 receptor for a pathogenic mammalian virus. Where the other virus is HIV, the 
expressed polypeptide is an HIV receptor such as CD4, an HIV coreceptor such as 
CXCR4, and/or mixtures of these. 

The invention correspondingly provides methods for treating patients 
infected with a pathogenic virus by administering to the patient an effective 
20 amount of a recombinant rhabdovirus of the invention such as VSV which express- 
es in its viral envelope at least one receptor for the pathogenic virus. As set out 
hereafter » treatment methods for patients mfected with HIV-l and/or HIV-2 are 
disclosed. 
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Description of the Figures 

Figure 1 illustrates diagrams of recombinant VSV genomes. The order 
of the genes in wild-type VSV and in the recombinant VSVs described is shown. 
Each extra gene was inserted so that it contained the appropriate signals specifying 
5 VSV polymerase transcription initiation and termination. The details of plasmid 
constructions and derivation of the recombinant viruses are given in the Examples. 
The genes are transcribed from left to right from the negative strand RNA genome 
which is shown in the 3'-5' orientation. 

Figure 2 shows photographs of cultures illustratingthat VSVs lacking 
10 the G gene form plaques on a complementing cell line. BHK-G cells were seeded 
on 6-well plates (1.5 x 10^ cells in each well), and G protein expression was 
induced by removal of tetracycline. The indicated dilutions of 100 fil the VSVAG- 
CD4 virus (10^ IC' and 10^> were added 16 h later, followed by addition of 2 ml 
of DME containing 2% methylcellulose. Cells were incubated for 48 h and then 
15 stained with napthalene black to visualize the small plaques. Wild-type VSV 
plaques are much larger and are visible after 18h, but plaque formation by the 
defective virus on this cell line was slower, probably because of the reduced level 
of complementing VSV G protein expressed. 

Figure 3 shows immunoelectron microscopic visualization of CD4 in the 
20 envelopes of VSVAG-CD4 and VSVAG-CC4. Purified wild-type VSV, VSVAG- 
CD4, or VSVAG-CC4 particles were negatively stained with phosphotungstic acid 
after labeling with a monoclonal antibody directed to CD4 followed by gold-conju- 
gate goat anti-mouse IgG particles. Wild-type VSV particles contained visible 
spikes of VSV G protein trimers (arrows) and did not bind antibody to CD4 (wild- 
25 type VSV, left panel), while VSVAG-CD4 or VSVAG-CC4 bound anti-CD4 
antibody and did not show visible spikes on the surface. 
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Figure 4 illustrates analysis of HIV-1 and VSVAG-CC4 infection by 
immunofluorescence microscopy. Duplicate cultures of Jurkat cells were infected 
with HIV-1 strain IIIB and five days later, superinfected with VSVAG-CC4 (panel 
D, E, F) or propagated without superinfection (panel A, B, C). Jurkat cells that 
5 were uninfected with HIV-l were also infected with VSVAG-CC4 (panel G, H, I). 
Fourteen days after the VSVAG-CC4 infection, cells were fixed and labeled 
with a mouse monoclonal antibody to VSV N protein and a secondary, affinity- 
purified, rhodamine-conjugated anti-mouse antibody (detected in panels C, F, I) 
and with a human polyclonal anti-HIV immune globulin followed by affinity- 
10 purified, FlTC-conjugated anti-human antibodies (detected in panels B, E, H). 
Panels A, D, and G show Nomarski images of the same fields of cells shown 
stained in the two images to their right. 

Figure 5 provides line graphs illustrating quantitation of HIV-1 and 
VSVAG-CC4 infection in Jurkat cells. Duplicate cultures of Jurkat cells were 

15 infected with HIV-1 IIIB at day 0 and either superinfected with VSVA-CC4 five 
days later (■) or propagated without superinfection (□). The percentage of 
HIV* cells was determined by immunofluorescence microscopy on the days 
indicated (Figure 5A). The percentage of HIV"^ cells that were also infected with 
VSVAG-CC4 was determined by inununofluorescence microscopy to detect VSV 

20 N protein (Figure 5B). One hundred to two himdred cells were counted from 
multiple random fields to determine the percentages of cells staining positive for 
HIV or VSV antigens. 

Figure 6 provides line graphs illustrating the effect of VSVAG-CC4 
infection on production of infectious HIV-1 and on release of RT. Media from 
25 Jurkat cell cultures were analyzed for infectious HIV titers and for RT activity as 
described in the Examples. Titers (Figure 5 A) were performed in triplicate and 
are reported as TCIDjo/ml (□, HIV infected only; ■, HIV + VSVAG-CC4). 
Titers determined by this assay were consistent with titers determined by an 
indirect immunofluorescence assay detecting HIV antigens. RT activity (Figure 
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5B) is reported in total counts per minute (cpm) bound to the DEAE paper after 
subtraction of a backgrounds of less than 50 cpm (□, HIV infected only; ■, 

HIV -f VSVAG-CC4; A, VSVAG-CC4 infected only). The numbers plotted are 

the average of duplicates that agreed within 5%. Samples were collected from 
5 Jurkat cultures on the indicated days after superinfection with VSVAG-CC4. 

Detailed Description of the Invention 

This invention is based upon the construction of a new virus from 
vesicular stomatitis virus (VSV) that is replication-competent and kills cells 
rapidly, but is only able to infect cells that are first infected with HIV. The virus 
10 contains a deletion in the VSV envelope glycoprotein gene and expresses instead 
the genes for the HIV receptor CD4 and an HIV coreceptor, CXCR4. The virus 
kills HIV-infected cells in culture and controls HIV infection. 

In the practice of this invention, replication-competent recombinant 
rhabdovinises are generated which lack a functional glycoprotein gene and express 

15 a foreign polypeptide in their viral envelopes. By "foreign" is meant not naturally 
occurring in the rhabdovirus transformed. The foreign polypeptide is one that 
binds to a mammalian cell such as a cellular receptor and/or coreceptor for 
another virus in many embodiments, and an antibody in other embodiments. The 
invention is particularly suited to target cells mfected with membrame-enveloped 

20 viruses. The invention provides a general strategy of virus targeting to infected 
cells in cases where receptor(s) recognized by vural envelope proteins are known, 
and the viral protein has membrane fusion activity. Likewise, both conventional 
and hybrid antibodies to pathologic cells are also known and are used to target 
these cells. In either case, foreign polypeptides are expressed in the rhabdovirus 

25 in a manner sufficient to target cells to be killed. Therefore, as used herein, by a 
"recombinant rhabdovirus lacking a functional glycoprotein gene" is meant a virus 
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having any alteration or disruption of the glycoprotein gene, and/or expressing a 
poorly functional or nonfunctional glycoprotein, or combinations thereof. 

In the examples that follow, recombinant VSVs are generated which lack 
a functional envelope glycoprotein G or corresponding gene and express instead at 
5 least one receptor or coreceptor of an HIV virus. In preferred embodiments, the 
G gene is deleted, but any mutation of the gene that alters the host range specifici- 
ty of VSV or otherwise eliminates the function of the G protein can be employed. 

A gene for a foreign polypeptide such as one that binds to manunalian 
cells mfected with a pathogenic virus is then inserted and/or added to the genome 

10 of the recombinant rhabdovirus. As sununarized above, in many typical embodi- 
ments, the added gene is a receptor recognized by a virus glycoprotein and that is 
present on the surface of mammalian cells. However, recombinant rhabdoviruses 
engineered to kill other cells, e.g., cancer cells, are encompassed by the invention. 
In these embodiments, genes for specific antibodies and a membrane fusion protein 

15 are typically incorporated into rhabdoviruses lacking the a functional glycoprotein 
gene. 

The invention thus provides medical or veterinary methods for treating a 
patient (human being or animal) infected with a pathogenic mammalian virus by 
administration to the patient of an effective amount of a replication-competent 

20 recombinant vesicular stomatitis virus which lacks a functional G glycoprotein 
gene and displays a receptor of the pathogenic virus in its coat in a manner 
sufficient to target cells infected with the pathogenic virus. An example wherein 
the pathogenic, membrane-enveloped virus is HIV and the envelope proteins added 
to the VSV are an HIV receptor and a specific coreceptor is given in the Examples 

25 section hereafter. In that illustration, a T cell tropic HIV is employed, but 

recombinant VSVs of the invention can been engineered to express coreceptors of 
other HIV strains. Other embodiments express a receptor and not a coreceptor. 
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and some recombinant VSVs of the invention express more than one HIV corecep- 
tor. 

In the practice of a preferred embodiment of the invention, a patient 
5 infected with HIV is treated by administering to the patient an effective amount of 
a replication-competent recombinant vesicular stomatitis virus which lacks a 
functional glycoprotein G gene and expresses at least one HIV receptor such as 
CD4 and at least one HIV coreceptor such as CXCR4, CCR5, CCR3, and/or 
mixtures thereof. An exemplary construct is VSVAG-CC4 described hereafter and 
10 illustrated in Figure 1 . 

Administration of a recombinant rhabdovirus of the invention to a person 
or other mammal can be via any local or systemic method known by skilled 
workers, but is preferably systemic. Systemic administration includes intravenous, 
intramuscular, or intradermal administration by sterile injections, parenteral 
IS administration, and the like, typically in combination with a pharmaceutically 
acceptable carrier and/or other adjuvant or adjunct compound that maintains 
viability of the recombinant VSV virus or other rhabdovirus, or enhances its 
effect. Combinations of therapies may also be employed. 

The amount of recombinant rhabdovirus necessary to bring about thera- 
20 peutic treatment is not fixed per se, and necessarily depends upon the severity or 
the extent of disease. Admmistration is facilitated and, in some cases additional 
therapeutic effects are provided by, the carrier. When a carrier is employed, it is 
necessary that it be inert in the sense of not inactivating the recombinant VSV or 
other rhabdovirus and in the sense of not bringing about any adverse effect to the 
25 patient to which it is administered. In most situations, doses are dependent upon 
the extent of the patient's disease, the age, weight, and clinical condition of the 
patient to be treated, the potency of the recombinant virus, the adjuvants or 
adjunct compounds (if any) employed, and the concentrations of recombinant 
rhabdovirus and other ingredients which are typically put together in association 
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with a pharmaceutically acceptable carrier. Most preferably, administration of the 
recombinant rhabdoviruses of the invention reduces the diseased cells in the patient 
so signficantly that they are barely detectable or undectable. 

It is an advantage of the invention that preferred recombinant VSVs 
5 which incorporate the receptor of one virus in the coat of another kill cells 
infected with the other virus rapidly and thus control infection. It is a further 
advantage of the invention that recombinant VSVs or other rhabdoviruses of the 
invention are genetically engineered to specific targets, and kill only infected or 
pathogenic cells. 

10 The following examples are presented to further illustrate and explain 

the present invention and should not be taken as Ihniting in any regard. 

Examples 

In the studies reported herein VSV recombinants are generated which 
exhibit a complete deletion of the VSV G gene and a substitution instead of the 
15 genes for CD4 and CXCR4. These viruses assemble particles containing both 
CD4 and CXCR4 in their envelopes and are targeted specifically to infect HIV-1- 
infected cells. 

Experunental Procedures 

Plasmid construction. Digestion with Mlul and Sphl was used to delete 
20 the entire VSV G gene from pVSV-CD4 (Schnell et al., 1996b). The resulting 
plasmid was designated pVSVAG-CD4. A plasmid encoding the CXCR4 protein 
with an HA epitope tag was described previously (Berson et al., 1996). CXCR4 
was amplified by PGR from this plasmid with VENT polymerase (Stratagene) 
using primers 5'-ACTGCCCGGGCTCGAGGTTACCATGGAGGGGATCAG-3' 
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(SEQ ID NO: 1) and 5'-AGCTGCGGCCGCTAGCTTAGCTCCCGGGAAGAGA- 
TGCG-3' (SEQ ID NO: 2). The first primer contains an Xlwl site and the second 
an Nhel site (boldface letters). The PGR product was digested with Xhol and Nhel 
and cloned into pVSV-XNl (Schnell et al., 1996a) that was digested with Xhol and 

5 Nhel. The resulting plasmid was designated pVSV-CXCR4. The G gene was 
deleted from pVSV-CXCR4 with Xhol and MM. The coding sequence for human 
CD4 was amplified from pCD4 (Shaw et al., 1989) by PGR using primers 5'- 
CCGGGTACCACGCGTACAATGAACCGGGGAGTCCCTTTTAG-3^ (SEQ ID 
NO: 3) and 5*-GGGCCCCTCGAGCGTGA TATCTGTTAGTTTTTTTCATA CTC- 

10 AAATGGGGCTACATGTCTTC-3' (SEQ ID NO: 4). The first primer contains 
an Mlul site (boldface letters); the second primer contains an Xhol site (boldface 
letters) and a VSV transcription stop-start signal (underlined letters). The CD4 
gene was inserted at the site of the G deletion to generate a plasmid that was 
designated pVSVADG-CC4. 

15 Construction of a BHK cell line expressing VSV G protein. The 

tetracycline-regulated system employed was described previously (Shockett et al., 
1995). A VSV G gene was prepared from a plasmid designated pBSG by digestion 
with Xhol, Extensions were filled in with Klenow enzyme, and the fragment 
ligated to pTet-Splice (Life Technologies) that had been cleaved with EcoRW. The 

20 resulting plasmid was called pTET-G. BHK cells on 10 cm plates (20% conflu- 
ent) were transfected with 1 fig pTET-G and 10 fig pTet-TAk (Life Technologies) 
using the GaP04 method (manunalian transfection kit; Stratagene). One day after 
transfection 0.5 fig/ml tetracycline was added to suppress G protein expression, 
which is toxic to BHK cells, and stable transfected cells were selected by adding 

25 0.75 mg/ml geneticin (G418). Ten days after transfection, cell colonies were 
isolated and, after induction by removal of tetracycline, they were screened for 
expression of VSV G protein by immunofluorescence microscopy. 

Recovery of VSV recombinants. Baby hamster kidney cells (BHK-21, 
ATCC) were maintained in DMEM (Dulbecco's modified Eagle's medium) supple- 
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mented with 5% fetal bovine serum (FBS). Cells on 10 cm dishes (-80% 
confluent) were infected at an MOI of 10 with vTF7-3 (Lawson et al., 1995). 
After 1 hour, plasmids encoding the N, P, G, and L proteins and the appropriate 
recombinant anti-genome RNA were transfected into the cells using a cationic 
liposome reagent containing dimethyldioctadecyl anunonium bromide and phos- 
phatidylethanolamine (Lawson et al., 1995; Rose et al., 1991). Plasmid amounts 
were 10 fig of the respective fiiU length plasmid (VSVAG-CD4 or VSVAG-CC4), 

3 fig pBS-N, 5 fig pBS-P, 4 fig BS-G and 2 ^g pBS-L. Supernatants of recoveries 
were filtered through 0.22 fim filters, and 5 ml was used to infect BHK-G cells on 
10 cm plates which had been induced for twelve hours by removal of tetracycline. 
Supernatants of the first passage were filtered twice through a syringe filter (0. 1 
^m pore size) to remove any vaccinia virus and the viruses were then passaged on 

4 X 10^ BHK-G cells. Supernatants (10 ml) were titered on BHK-G cells and 
frozen at -80°C. For production of virus stocks, 2 x 10^ BHK cells on 10 cm 
plates were infected for 2 h with 10 ml supernatants of VSVAG-CD4 or VSVAG- 
CC4 grown on BHK-G cells, washed three tunes with phosphate buffered salme 
(PBS) followed by addition of 10 ml RPMI medium with 10% fetal bovine serum 
(FBS) or 10 ml DMEM with 5% FBS. At 16 h after infection, supernatants were 
clarified by centrifugation and used for experiments or stored at -80 'C. 

Preparation and analvsis of viruses. For isolation of virions, a monolay- 
er of BHK cells (-80% confluent) on a 10 cm dish was infected with VSVAG- 
CD4 or VSVAG-CC4 (multiplicity of infection, MOI =5), or wild-type VSV 
(MOI =0.01). At 16 h after infection, cell debris and nuclei were removed by 
centrifugation at 1,250 x g for 5 min and virus was then purified twice by ultra- 
centrifiigation at 120,000g through a 20% sucrose solution containing 10 mM Tris- 
HCl, pH 7.5, and 1 mM EDTA. Virus pellets were resuspended and analyzed by 
SDS-PAGE (10% acrylamide). Gels were either stained with Coomassie brilliant 
blue or transferred to nitrocellulose membranes. Membranes were blocked with 
5% low fat milk powder in TBST (10 mM Tris-HCl pH 8.0, 150 mM NaCl, 
0.05% Tween™-20) for 1 h at room temperature and then incubated with an anti- 
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body directed against CD4 (sheep a-CD4, AIDS Research Reference and Reagent 
Program) or an antibody 12CA5 (Boehringer-Mannheim) directed against the HA 
epitope in CXCR4. Proteins were visualized after incubation with horseradish 
peroxidase-conjugated donkey anti-sheep IgG or horseradish-peroxidase conjugated 
5 goat anti-mouse IgG (Jackson Reasearch) for 1 h at room temperature using 
enhanced chemiluminescence (ECL; Amersham) system. 

Electron microscopy and labeling with gold-coniugated antibodies . 
Virus particles from infection of approximately 10^ cells on a 15 cm dish were 
recovered from the culture medium by first pelleting cell debris at 1,500 x g for 

10 10 min. The virus was then concentrated and purified by centrifugation as 

described above. Virus samples were absorbed onto carbon coated grids for 5 min 
and then blocked with 1 % BSA in PBS for 10 min at room temperature. The 
grids were then placed on a 50 /il drop of anti-CD4 Mab OKT4 (Reinherz et al., 
1979) diluted 1:50 in PBS containing 1% BSA. After 1 hour, excess antibody was 

15 removed by placing grids sequentially onto five SO-fxl drops of 1 % BSA in PBS 
for 2 min each time. The grids were then placed on a 50-/il drop of goat anti- 
mouse IgG (Fc) labeled with 15 nm gold particles (AuroProbe, Amersham Inc.). 
Unbound gold conjugates were removed by five sequential 2-min washes with 
PBS. The virus-immunogold complexes were then negative stained by incubation 

20 the grids for 4 min on 50-/xl drops of 2% phosphotungstic acid (ph 7.2). Excess 
stain was removed and the grids were air dried. Images of viruses were obtained 
with a Zeiss EM910 electron microscope. 

Infection and inununofluorescence microscopv of infected Jurkat cells. 
Approximately 3 x 10^ Jurkat Cells (clone E6-1, ATCC #TIB-125) were seeded in 
25 T-25 flasks and grown in RPMI medium supplemented with 10% FBS. Infection 
with HIV-1 strain IIIB was at an MOI of approximately 0.01. At five days post- 
infection, cells were superinfected with 5 ml of medium containing VSVAG-CC4. 
As controls, one HIV-infected culture was not superinfected, and a third culture 
was infected only with VSVAG-CC4. Cells were collected for analysis at each 
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time point as follows. One half of the volume of the culture was collected and an 
equal volume of RPMI supplemented with 10% FBS was added back to the 
original cultures. Cells were pelleted at 1000 rpm for five minutes at S^'C in an 
lEC Centra-7R refridgerated centrifuge (Fisher Scientific). The media was 
5 discarded, cells were washed once in 5 ml PBS, and pelleted again. The PBS was 
removed and cells were resuspended in 100 /xl PBS. The cells were then fixed in 
1 ml of 3% paraformaldehyde for one hour at room temperature, or overnight at 
4°C. After permeabilization in 1% Triton X-100, cells were stamed with a 
monoclonal antibody to VSV N protein and with human polyclonal HIV im- 
10 munoglobulin (AIDS Research Reference and Reagent Program), followed by 
rhodamine-conjugated anti-mouse and FITC-conjugated anti-human antibodies 
(Jackson Research). 

Cells were examined and photographed with 40X planapochromat 
objective on a Nikon Microphot-FX microscope equipped for epifluorescence. 
15 Infection of cells with HIV-l or VSVAG-CC4 was monitored by counting at least 
three random fields of cells that typically contained 100-200 cells. Syncytia were 
counted as single cells. 

Determination of HIV Titers. HIV titers were obtained using supema- 
tants from infected cultures and making serial three-fold dilutions in 96-well 

20 plates. Ten thousand MT-2 cells provided by the AIDS Research Reference and 
Reagent Program (ARRRP) were added to each well and the virus and cells were 
mixed by pipeting. Plates were incubated at 37*C for 3-5 days and scored for the 
presence of syncytia. Titers at each time point were performed in triplicate and 
two or more syncytia per well were counted as positive for HIV infection. 

25 Titers were calculated as described previously (Reed and Muench, 1938) and are 
reported as 50% tissue culture infectious doses( TCID50) per ml. 

Reverse Transcriptase Assavs. Each assay was performed in 96-well 
plates in duplicate (Willey et al., 1988). Twenty-five microliters of reverse 
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transcription reaction mix (0.05% NP-40, 50 mM Tris-HCl, pH 7.8, 75 mM KCl. 
2 mM DTT, 5 mM MgCla, 5 units oligo (dT)12-18, 5 mg/ml poly A) was added, 
followed by addition of 10 /il of cell-free culture supernatants. An additional 25 
of reaction mix was added containing 10 /iCi/ml Q:-p^P]-dTTP, and the reaction 

5 was placed at 37X for 90 minutes. Five microliters of each sample was then 
spotted onto Whatman DE81 paper, which was allowed to air dry. The DE81 
paper was then washed in 2 x SSC (0.3 M NaCl, 20 mM sodium citrate) buffer 
four times, each time at room temperature for five minutes, followed by two one- 
minute washes in 95% ethanol at room temperature. After air drying, the paper 

10 was exposed to a Molecular Dynamics Phosporlmager screen overnight. Samples 
were quantitated directly on the phosphorimager and by counting in a scintillation 
spectrometer after excision from the paper. Duplicate samples agreed within 5%. 

Neutralization of mfection bv VSVDG-CC4. Polyclonal sheep anti-CD4 
serum and anti-gpl20 serum (ARRRP) were heat treated at 56X for one hour to 

15 inactivate complement. The anti-VSV neutralizing monoclonal antibody (II) was a 
mouse ascites fluid. Neutralizations were performed as follows. VSVAG-CC4 
virus (1 ml derived from - 10^ infected BHK cells) was mixed with 100 ^1 anti- 
CD4 or 100 /zl anti-VSV for one hour at 37 *C, and then the mixture was added to 
- 10^ Jurkat cells that had been infected with an MOI of 0.01 HIV-1 IIIB five 

20 days previously. Alternatively, 100 /zl anti-gpl20 serum was added directly to 1 
ml of HIV-infected Jurkat cells for one hour at 37X. Then 1 ml of VSVAG-CC4 
was added to the cells. After 3 days, cells were pelleted at 1000 rpm in a clinical 
centrifuge for 5 minutes and washed once in PBS. Cells were fixed and stained to 
detect HIV and VSV N protein antigens. 

25 Results 

Recovery of VSV recombinants expressing CD4 and CXCR4. Genetic 
engmeering of non-segmented negative-strand RNA viruses such as VSV is 
complicated by the fact that the minimal infectious unit of the virus is not RNA, 
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but rather the viral RNA genome in a ribonucleoprotein complex composed of 
nucleocapsid (N) and polymerase proteins (L and P). Infectious VSV can be 
recovered from plasmid DNA as follows. Four separate plasmids expressing the 
full-length antigenomic VSV RNA and the N, P and L genes under control of 
bacteriophage T7 promoters are transfected into cells already infected with a 
vaccinia virus recombinant expressing T7 RNA polymerase (vTF7-3). Intracellu- 
lar assembly of the antigenomic RNA into nucleocapsids is followed by VSV 
replication and transcription to generate infectious, replication-competent VSV 
(Fuerst et al.. 1986; Lawson et al., 1995; Whelan et al.. 1995). Additional genes 
can be expressed from extra transcription units introduced into the VSV genome 
(Schnell etal., 1996a). 

The construction of a VSV recombinant expressing the CD4 protein in 
addition to the five VSV proteins N, P, M, G and L has been previously reported 
(Schnell et al., 1996b). To generate a recombinant expressing CD4 and CXCR4 
instead of VSV G, the upstream VSV G gene was deleted from the VSVCD4 con- 
struct (Figure 1). The recovery of virus, designated VSVAG-CD4, was then 
accomplished by including a complementing plasmid encoding VSV G protein in 
addition to those encoding N, P, and L, The gene encoding CXCR4 at the 
position downstream of CD4 was next added and the virus designated VSVAG- 
CC4 was recovered. Because the recovered viruses are capable of only a single 
round of infection on BHK cells, they were initially recovered and grown on cells 
infected with a vaccinia virus encoding the T7 RNA polymerase and transiently 
expressing a complementing VSV G protein from a transfected plasmid DNA. 

A cell line expressing VSV G protein allows growth of the defective 
viruses. In order to grow the VSVAG-CD4 and VSVAG-CC4 virus in the absence 
of vaccinia virus, a BHK cell line expressing the VSV G protein from an inducible 
promoter (Shockett et al., 1995) was generated. Figure 2 shows that the VSVAG- 
CC4 virus forms plaques on this cell line mdicating multiple rounds of infection. 
Titers obtained after growing the defective viruses on this line were in the range of 
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0.5 - 1 X 10^ pfii/ml. These titers are reduced at least 1000-fold compared to 
those of wild'type VSV presumably because the levels of G protein provided by 
the complementing cell line are low compared to those expressed in wild-type 
infection. 

5 Incorporation of CD4 and CXCR4 into the recombinant viruses. To 

determine if the CD4 and CXCR4 molecules were expressed from the defective 
viruses and incorporated into virus particles, BHK cells (not expressing VSV G) 
were mfected with VSVAG-CD4 or VSVAG-CC4 that had been grown on the 
BHK-G line (MOI = 5). After 16 hours when all cells had been killed, the virus 

10 particles were purified from the medium. Viral proteins were then separated by 
SDS-PAGE and detected by staining with Coomassie blue. The wild-type virus 
contains the L, G, N, and M proteins (P is a minor protein comigrating with N) 
while the VSVAG-CD4 and VSVAG-CC4 viruses lack G but contain a new protein 
of about 55,000 daltons, the size expected for CD4. The presence of CD4 in the 

15 particles was also verified by western blotting with sheep anti-CD4 antibody or 
monoclonal antibody 12CA5 to the HA epitope tage in CXCR4. The CXCR4 
protein was not detected in VSVAG-CC4 by staining, perhaps because it contains 
seven membrane spanning segments that does not bind the stain, but was easily 
detected by western blotting. 

20 Budding of VSV in the absence of VSV G protein is known to be 

approximately 20-fold less efficient than budding in its presence (Knipe et al., 
1977). Similar results have been obtained for rabies virus (Mebatsion et al., 1996). 
Although the amounts of protein detected by Coomassie blue staining appeared 
similar for all viruses, the virus loaded in test lanes were derived from 10^ 

25 infected BHK cells, while the wild-type virus in the control lane was from only 2 
X 10^, or 50-fold fewer infected cells to achieve similar staining. The resuhs not 
only confirm the importance of VSV G protein in driving highly efficient virus 
budding, but also illustrate that significant amounts of virus production occur even 
in the absence of VSV G protein. 
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To examine the morphology of the recombinant virus particles and to 
verify that CD4 was exposed on the viral membrane, purified virus particles were 
examined by electron microscopy after labeling with antibody to CD4 and a gold- 
conjugated secondary antibody. Photographs of negatively stained virus particles 
5 are shown in Figure 3. Wild-type VSV particles contained visible G protein 
spikes (arrows) and did not bind antibody to CD4, while the VSVAG-CD4 or 
VSVAG-CC4 viruses lacked visible spikes and did bind anti-CD4 antibodies. The 
antibody to CXCR4 used in western blotting recognizes the internal C-terminal 
HA tag on CXCR4 and therefore is not suitable for labeling virus particles. 

10 Infection of HIV-1 Infected Cells bv VSVAG-CC4. Because cells 

infected with HIV-1 have the gp 120/41 envelope protein on their surface, it might 
be possible to infect these cells with VSVAG-CC4. The gp 120/41 molecules 
present on the cell surface should bind to CD4 and CXCR4 on the virus surface 
and promote fusion of the viral and cell membranes. To test this possibility, a 

15 human T cell line (Jurkat) was infected with HFV-l IIIB. At five days post- 
infection, it was expected from previous experhnents that about half of the cells 
would be infected with HIV-1, and VSVAG-CC4 virus was added. Two control 
cultures, one infected with HIV-1 only, and another infected with VSVAG-CC4 
only, were followed in parallel. 



20 Initially infection by HFV-l and VSVAG-CC4 using indirect immuno- 

fluorescence microscopy to detect both HIV-1 proteins and VSV nucleocapsid (N) 
protein was followed. An example of results from the inmiunofluorescence 
analysis at 19 days after HIV infection is shown in Figure 4, along with a Nomar- 
ski image of the same field of cells. A compilation of the immunofluorescence 

25 data obtained throughout the experiment is graphed in Figure 5. The fields shown 
in Figure 4 illustrate that in the control culture mfected with HIV-1 only, nearly 
all cells stained positive for HIV antigens (A and B) while none of the cells stained 
positive for VSV N protein (C). In the cells infected with HIV-1 and VSVAG- 
CC4, only a small fraction of the cells were positive for HIV-1 antigens (D and 
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E), and the majority of those cells were also positive for VSV N protein (F), 
indicating that they had been superinfected with VSVAG-CC4. It was also noted 
that rare cells that were strongly positive for VSV N protein expression, had only 
little or no HIV protein evident. These cells may result from shut-off of cellular 
5 and HIV protein synthesis by VSVAG-CC4 or from other mechanisms of infection 
(see Discussion below). In Jurkat cells infected with VSVAG-CC4 only, none of 
the cells showed the presence HIV or VSV N antigens (H, I, and J). 

From the quantitation in Figure S it is evident that when cells were 
infected with HIV-1 alone, the percentage of HIV-1* cells rose to nearly 100% 

10 and remained high throughout the experiment. When cells were first infected with 
HIV-l and subsequently infected with VSVAG-CC4, the percentage of HIV+ cells 
began to decrease at day 14 and plateaued at day 21 to levels between 5 and 10% 
HIV^. In addition. Figure 5 shows that the percentage of HIV-1 + cells that were 
also infected with VSVAG-CC4 increased and then reached a plateau at around 90- 

15 95%. It therefore appears that cells infected with HIV-l are rapidly superinfected 
with VSVAG-CC4 and killed before high levels of HIV-l can be released. A 
balance is then reached between the two viruses where HIV infection is controlled 
by VSVAG-CC4 infection and maintained at low levels. Very similar results were 
obtained in an experiment performed with the H9 T cell line, another cell line in 

20 which HIV-l IIIB grows well. 

The possibility that the VSVAG-CD4 virus lacking the coreceptor gene 
might infect HIV-l infected cells was also tested. In one experiment it was 
observed that a low level of infection by VSVAG-CD4 (about 10% that observed 
with VSVAG-CC4) occurred. However, the initial infection could be completely 
25 neutralized with antibody to VSV G, indicating that a trace of G protein 
sufficient to start infection must have been present in the virions. This 
G protein was presumably carried into the BHK cell line from the input virus 
grown in the BHK-G line and then incorporated into exiting virions. Interestingly, 
once the infection was established in HIV-infected Jurkat cells, it persisted for 




wo 99/02657 



PCTAJS98/14527 



- 19- 



10 



15 



20 



several weeks, indicating that the vims could propagate without encoding its own 
coreceptor. Presumably budding virions can pick up cell-encoded coreceptor and 
are then able to propagate. However, control of HIV-1 infection was not as 
strong as with VSVAG-CC4, indicating that it is preferable to have the virus 
encode high levels of the coreceptor. Because the control of HIV infection was 
less effective without coreceptor, further studies with this virus were not pursued. 

VSVAG-CC4 infection controls release of infectious HIV-1 . To obtain 
measurements of the effects VSVAG-CC4 on production of infectious HIV-1 and 
on release of HlV-1 particles, the experiments shown in Figures 4 and 5 were 
repeated, but instead followed HIV-1 titers and release of reverse transcriptase 
(RT) into the medium (Figures 6A and 6B). At the time of addition of VSVAG- 
CC4 at three days after HIV-1 infection, HIV titers and RT levels in the medium 
were low. In the control cells infected with HIV-1 alone or in the cells super- 
infected with VSVAG-CC4, both the HIV titers and RT activity peaked at ten days 
after infection. There was then a decline in RT activity by day 14 in cells 
superinfected with VSVAG-CC4 and a leveling off at about 10% of the RT level 
seen in control cells. The results with titers of infectious HIV-1 were much more 
dramatic, with a complete disappearance of infectious HIV-1 (> 4 logs below the 
control) at day 14 followed by a fluctuation at levels of only 0-39 infectious 
particles per milliliter out to day 34. These titers should be compared to persistent 
titers of - 10*/ml for the control culture that was not infected with VSVAG-CC4. 

The RT results were consistent with what has been seen previously with 
the immunofluorescence in Figure 5, but the much larger decreases in infectious 
HIV titers were not expected. These lower titers could be explained if the 
VSVAG-CC4 infection selected for HIV-infected cells that expressed little or no 
HIV envelope and thus produced HIV particles lacking envelope but containing 



RT. 
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Anti-CD4 or anti-gpl20 block infection by VSVAG>CC4. Control 
neutralization experiments (Table 1) to verify that entry of VSVAG-CC4 into HIV 
infected cells was dependent on CD4 and gpl20 were also performed. VSVAG- 
CC4 was incubated with antibody to CD4, neutralizing antibody to VSV-G, or 
5 without antibody before addition to Jurkat cells that had been infected with HIV-1 
for five days as in Figure 5. Table 1 shows that anti-CD4 completely eliminated 
the VSVAG-CC4 infection, while antibody to G did not. 



Table 1. Neutralization of VSVAG-CC4 Infectivity 


Antibody' 


% H1V+ Cells" 


% VSV N+ Cells" 


None 


60 


39 


anti-VSV G 


59 


37 


anti-CD4 


74 


0 


anti-HIV gp 120 


71 


0 



* Viruses grown on BHK cells not expressing VSV G were incubated with anti- 
15 VSV G or anti-CD4, or cells were incubated with anti-HIV gpl20 prior to mixing 
of virus and cells. 

^ HIV positive and VSV cells were identified by indirect inmiunofluorescence 
microscopy as in Figures 4 and 5. Approximately 100 cells from random fields 
were counted for each determination. The percentage of cells positive for HIV 
20 antigens or VSV N protein is mdicated. 



Similarly, pre-incubation of the HIV-infected cells with a polyclonal anti-gpl20 
antibody completely prevented infection by VSVAG-CC4. Because CD4 can bind 
MHC class 11 on cell surfaces (Doyle and Strominger, 1987), it was conceivable 
that some low level of infection could occur after binding of VSVAG-CC4 to 
25 MHC class II on cells, although this would not be expected without a membrane 
fusion protein. To test this possibility Daudi cells, a lymphocyte cell line known 
to express abundant quantities of MHC class II protein (Till et al., 1988), was 
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used. These cells were readily infected by wild-type VSV, but no infection by 
VSVAG-CC4 was observed. 

Discussion 

A recombinant VSV that lacks its own envelope protein gene (G) and 
5 expresses instead the HIV-l receptor and coreceptor molecules, CD4 and CXCR4 
was constructed. This novel virus is defective for entry into normal cells, but can 
be propagated on cells expressing a complementing VSV envelope protein. The 
complemented virus can then be . grown on cells not expressing G protein where it 
buds particles containing CD4 and CXCR4 in the viral membrane. These viruses 
10 are replication competent, but can enter and propagate only in cells that are first 
infected with HIV-l and express HIV-l envelope protem. Infection of these cells 
by VSVAG-CC4 is neutralized by antibody to CD4 added to the virus, or by 
antibody to HIV gpl20 added to the cells. 

The results showed that in a mixed population of cells about half of 
15 which were HIV-l -infected and half were uninfected, superinfection with VSVAG- 
CC4 ultimately limited production of infectious HIV-l to levels that were barely 
detectable or undetectable, at least 300-fold to 104-fold lower than HIV titers 
produced by control cells that were not superinfected with VSVAG-CC4. This 
reduction presumably is the result of killing of the HIV-l infected cells by the 
20 VSVAG-CC4 virus. The reduction in HIV-l titers was greater than would have 
been predicted from analysis of the number of HIV-l -infected cells or from RT 
assays. The number of HIV positive cells was reduced about ten-fold by the 
VSVAG-CC4 infection, and the release of RT was reduced to a similar extent. In 
considering the mechanism of VSVAG-CC4 action, this discrepancy is reasonable 
25 since VSVAG-CC4 will target and kill those cells expressing HIV envelope protein 
and thus select for cells that produce little or no HIV envelope. Such cells would 
still score positive for HIV-l antigen production, and would produce normal levels 
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of HIV-like particles containing RT, but the majority of these particles would be 
noninfectious. 

The persistence of a low level of both HIV-1 and VSVAG-CC4 
coinfection for over one month in culture indicates that both viruses are 
5 continuing to replicate and that they have reached a balance. Based on the data, it 
seems unlikely that VSVAG-CC4 will ever completely eliminate the HIV-1 
infection in the culture. This low level persistence may occur because VSVAG- 
CC4 can only infect HIV-1 mfected cells after gpl20/41 is expressed on the cell 
surface and such cells likely release some HIV-l before they are killed. 

10 

The VSVAG-CC4 virus infects cells that have already been infected with 
HIV-1, but it may also be fusing directly with HIV-1 virions. Such fused viruses 
would contain the nucleocapsids of HIV and VSV and might be able to initiate 
infection if sufficient gpl20/41 were still present on the membrane. Such an 

15 infection would almost certainly favor replication of VSVAG-CC4 because VSV 
would shut off cellular protein synthesis in 2-3 hours, even before HIV had suffi- 
cient tune for reverse transcription and integration. Infection by these fused 
viruses could explain the presence of cells (Figure 4) that are making VSV 
proteins but do not appear to have synthesized HIV proteins. While not wishing 

20 to be bound to any theory, it is also possible that VSV has inhibited HIV protein 
synthesis in these cells. Infection of uninfected cells by VSVAG-CC4 might also 
occur if VSVAG-CC4 pseudotypes carrying HIV envelope protein were generated 
in HIV-infected cells. Incorporation of the wild-type HIV envelope into VSV 
particles is extremely inefficient (Johnson et al., 1997) and therefore this latter 

25 mechanism would likely be rare. To test directly for such pseudotypes, a short- 
term infection of Jurkat cells with supernatants from cells coinfected with HIV-1 
and VSVAG-CC4 was carried out. VSV protein expression was not detected in 
these cells, while it was readily detected by using the same supernatant to infect 
HIV-infected Jurkat cells. It was concluded that the VSVAG-CC4(HIV) pseudo- 

30 types are extremely rare if they exist at all. 
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Several aspects of this system that are worth noting. First, because 
VSVAG-CC4 is specifically targeted to cells already infected with HIV-1, the 
VSAG-CC4 infection level would be expected to parallel HIV infection and disap- 
pear with clearance of HIV. Second, VSVAG-CC4 encodes only human proteins 
5 as its envelope proteins, and thus would not induce production of neutralizing 
antibodies. The internal VSV proteins would be expected to induce cytotoxic T 
cell responses, but such responses might help in clearing HIV infection by rapidly 
killing cells infected with HIV-1 and VSVAG-CC4. Third, mutations to drug 
resistance are rapidly selected in HIV-1, but resistance to infection by VSVAG- 
10 CC4 would require loss of HIV's ability to bind CD4 or coreceptor and would 
therefore not be selected. 

The results also illustrate an important aspect of VSV budding and 
assembly. Based on virus protein yields and titers, it is esthnated that each BHK 
cell infected with wild-type VSV produces approximately 100,000 particles, and 

15 up to 10% of these particles are infectious. The deletion of the VSV G gene 
reduced particle yields of VSVAG-CC4 or VSVAG-CD4 approximately 30-50 
fold, indicating a substantial role for G in promoting budding. However, despite 
this reduced budding, at least 2,000 particles of VSVAG-CC4 or VSVAG-CD4 are 
produced from each BHK cell in the absence of G protein. The incorporation of 

20 CD4 and CXCR4 into these particles is presumed to occur by a passive process of 
protein trapping as virus buds from the cell surface, because neither protein would 
be expected to contain specific signals promoting incorporation. Because VSV 
blocks host protein synthesis and expresses extremely large amounts of its mRNA 
and proteins, CD4 and CXCR4 are likely to be the major proteins present on cell 

25 surface and therefore the major proteins in the VSVAG-CC4 envelope. 
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Tbe papers cited herein are expressly incorporated in their entireties by 

reference. 

The invention was made with partial government support under NIH 
grants AI24345 and AI49235. The government has certain rights in the invention. 



ordinary skill in the art how to practice the present invention, and it is not intend- 
ed to detail all those obvious modifications and variations of it which will become 
apparent to the skilled worker upon reading the description. It is intended, how- 
ever, that all such obvious modifications and variations be included within the 
10 scope of the present invention, which is defined by the following claims. The 
claims are intended to cover the claimed components and steps in any sequence 
which is effective to meet the objectives there intended, unless the context speci- 
fically indicates the contrary. 



5 



The above description is for the purpose of teaching the person of 
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CLAIMS 



1 . A replication-competent recombinant rhabdovinis which lacks a functional 
glycoprotein gene and encodes at least one foreign polypeptide that is expressed in 
its viral envelope. 

2. A recombinant virus according to claim 1 wherein the polypeptide is a cellular 
receptor of another virus which is expressed in the rhabdovinis in a manner 
sufficient to target cells infected with the other virus. 

3. A recombinant virus according to claim 1 which is a vesicular stomatitis virus 
which lacks a functional G glycoprotein gene and encodes at least one cellular 
receptor of a pathogenic mammalian virus. 

4. A recombinant virus according to claim 3 wherein the pathogenic virus is HIV- 
1 or HIV-2. 

5. A recombinant virus according to claim 4 wherein the pathogenic virus is HIV- 
1. 

6. A recombinant virus according to claim 4 which expresses in its viral envelope 
at least one HIV receptor, at least one HIV co-receptor, or mixtures thereof. 

7. A recombinant virus according to claim 3 comprising the additional genomes 
illustrated in Figure 1. 

8. A method for treating a patient infected with a pathogenic virus comprising 
administering to the patient an effective amount of a recombinant virus according 
to claims 1, 2, 3, 4, 5, 6, or 7 which expresses in its viral envelope at least one 
receptor of the pathogenic virus so that the patient's cells harboring the pathogenic 

S virus are targeted. 
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9. A replication-competent recombinant vesicular stomatitis virus which lacks a 
functional glycoprotein G gene and expresses at least one HIV receptor and at 
least one HIV co-receptor. 

10. A method for treating a patient infected with HIV-1 or HIV-2 by adminis- 
tering to the patient an effective amount of a recombinant virus according to 
claims 1, 2, 3, 4, 5. 6, 7, or 9. 
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(A) ADDRESS: Yale School of Medicine 

(B) STREET: 310 Cedar Street Street 

(C) CITY: New Haven 

(D) STATE: Connecticut 

(E) COUNTRY: United States of America 

(F) ZIP CODE: 06510 

(v) COMPUTER READABLE FORM: 



(A) MEDIUM TYPE: 3.5" 1.44 Mb diskette 

(B) COMPUTER: IBM PC 

(C) OPERATING SYSTEM: MS DOS 
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(vi) 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: DNA 

(ix) FEATURE: 

(A) NAME/KEY: primer used in constructs 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ACTGCCCGGG CTCGAGGTTA CCATGGAGGG GATCAG 36 



(3) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: DNA 

(ix) FEATURE: 

(A) NAME/KEY: primer used in constructs 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

AGCTGCGGCC GCTAGCTTAG CTCCCGGGAA GAGATGCG 38 



(4) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: DNA 

(ix) FEATURE: 

(A) NAME/KEY: primer used in constructs 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CCGGGTACCA CGCGTACAAT GAACCGGGGA GTCCCTTTTA G 41 

(5) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: DNA 

(ix) FEATURE: 

(A) NAME/KEY: primer used in constructs 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGGCCCCTCG AGCGTGATAT CTGTTAGTTT TTTTCATACT 40 
CAAATGGGGC TACATGTCTT C 61 
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